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CHAPTER 1 

DISTRIBUTIONS OF GEOMETRICAL ELEMENTS 

The probability measure of geometrical elements 

1.1 In probability theory one is usually concerned with random vari- 
ables which are quantities, or sets of quantities, taking values in some 
set of possibilities on which there is defined a non-negative measure, 
satisfying certain required conditions which enable us to interpret it 
as a probability. In the theory of geometrical probabilities the random 
elements are not quantities but geometrical objects such as points, lines 
and rotations. Since the ascription of a measure to such elements is not 
quite an obvious procedure, a number of "paradoxes" can be produced 
by failure to distinguish the reference set. These are all based on a 
simple confusion of ideas but may be useful in illustrating the way in 
which geometric probabilities should be defined. 

1.2 We consider one due to J. Bertrand (1907). The problem is to 
find the probability that a "random chord" of a circle of unit radius has 
a length greater than <v/3> the side of an inscribed equilateral triangle. 

We may consider three possible solutions 
of this problem. 

(1) Any chord of the circle intersects it 
in two points, and we may suppose these 
to be independently distributed in proba- 
bility distributions which are uniform over 
the circumference of the circle. Without 
loss of generality we can suppose one of the 
two points to be at a vertex of an inscribed 
equilateral triangle. There is then just of 
Fig. 1.1 the circumference in which the other point 

can lie in order that the resulting chord has 
length greater than \/3> so that the probability is . 

(2) The length of the chord depends on its distance from the centre 
of the circle and not on its direction. We may therefore suppose that 
it has a fixed direction perpendicular to a given diameter of the circle 
and that its point of intersection with this diameter has a uniform dis- 
tribution. For the chord to have a length greater than \/3 the distance 
of the point of intersection from the centre of the circle must be less 
than J, so that the probability is J. 
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(3) Any chord is uniquely defined by the foot of a perpendicular 
on it from the centre. If this point is distributed uniformly over the 
circle the probability of it lying in any region of area A is An~ l since 
the total area of the circle is n. For the chord to have length greater 
than \/3 the foot of the perpendicular must lie inside a circle of radius \ 
and hence the probability is J. 

1.3 All three solutions are correct, but they really refer to different 
problems. In all questions concerning geometrical probability we have 
first to define what is meant by the phrase "at random' '. If we con- 
sider the chord as being determined by the angle which it makes with 
a fixed direction, and by its distance p from the centre of the circle, the 
determination of what is meant by "at random" is equivalent to deter- 
mining a joint probability density for and^> on their range of variation 
which is (0 ^ < 2n$ < p < 1). In the first solution of Bertrand's 
example let a and p be the angular coordinates of the ends of the chord, 
so that < a,/? < 2n and the joint probability distribution is 
(2n)- 2 dv.df$. We also have 

a,/? = Oicos- 1 /) 

so that by changing variables we see that the joint probability distribu- 
tion of and p must be given by 

dpdO 



Similarly in the second solution the distribution of p and 6 is given 
by 



and in the third by 

(n)~ l pdpdO. 

1.4 In general, to define the distribution of a geometric object we must 
first determine a system of coordinates which define the object uniquely, 
and then define a probability distribution on the range of these co- 
ordinates. As examples of such coordinates we may consider the fol- 
lowing : 

(1) Points in Euclidean space of one, two, three, or more dimensions 
can be defined by their Cartesian coordinates. Here the parameter space 
coincides with the space of elements. 

(2) Lines in two-dimensional space may be defined by their inter- 
section with one of the axes and the angle they make with this axis, or 
by the latter and their shortest distance from the origin. Similarly, lines 
in three dimensions can be defined by the coordinates of their intersec- 
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tion with a given plane, and their direction. They therefore require 
four coordinates for their complete specification. We could also define 
a straight line by its six Pliicker coordinates, but this is no longer a 
unique representation since the line depends only on the ratios of these 
coordinates, and in addition there is an equation connecting them. This 
makes the setting up of a probability measure in the corresponding case 
more difficult, and it is nearly always desirable to represent a geometric 
object in a coordinate space in such a way that, not only is there a one- 
to-one correspondence, but the set in the space corresponding to all 
possible objects has the same dimensionality as the coordinate space. 

(3) A plane in three dimensions can be determined by the coeffi- 
cients (, v, w) of its representation as a linear equation in Cartesian 
coordinates, 

ux-\-vy + wz+l = 0, 

or, what is often much more convenient, by its distance p from the 
origin, and the polar co-ordinates 0, (f> of the perpendicular from the 
origin on to the plane. In any case the coordinate space must be three- 
dimensional. 

(4) A translation in three dimensions may be represented by the 
three coordinates, (a, b, c) say, of the point which is taken as the new 
origin of coordinates. 

(5) A rotation in three dimensions may also be represented by three 
coordinates. These could be taken as the polar coordinates 0, <j> of the 
axis of rotation, together with the angle, y, of rotation. Alternatively, 
since any rotation in three dimensions can be represented by an ortho- 
gonal matrix, we could take any three elements of this matrix which are 
independent, since such a matrix is determined by any three such 
elements. 

1.5 We notice that in all the above examples, with the exception of 
rotation, the set of possible coordinates has an infinite measure or 
volume. On the other hand, if we place further restrictions on the geo- 
metrical objects we may obtain sets in the coordinate space which are 
bounded. Thus, for example, the coordinates corresponding to all planes 
in three dimensions which intersect a bounded figure themselves form 
a bounded set. 

1.6 If we have a fixed number of geometric elements we may define 
their joint probability distribution in the space of the coordinates. Sup- 
pose that the coordinates defining the element are (z l . . ., z k ) in a space ti 
which may or may not be bounded. Representing the coordinates by 
a vector z, we suppose that we have a non-negative measure P(E), 
defined on an additive class of sets E in the space ii, which is such that 
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P(ti) = f dP = 1. 



In most cases, P(E) will be the Lebesgue integral of a continuous or 
fairly regular function over the sets E y and the additive class can be 
taken as all those sets which are measurable in the Lebesgue sense. 

1.7 The probability distribution of a fixed number of geometric ele- 
ments which are independent can then be taken as the product of such 
distributions. However, we also need to consider cases where the num- 
ber of elements is itself a random variable. The most natural assump- 
tion to make is that the number of elements in any specified subset E 
of Q, has a Poisson distribution independently of the number in any 
other disjoint set, so that the number N of elements in E has prob- 
ability e^A^AH)" 1 , where X depends on E. This is a consistent defini- 
tion since the sum of two independent Poisson variates is itself a 
Poisson variate. 

1.8 Hence we define a measure M(E) on the space i of all possible 
parameter points. This measure will usually not be bounded, i.e. 
M(Q) may be infinite, but it will be <r-finite, that is to say that Q, can be 
decomposed into a finite or enumerably infinite number of sets E ly 
Efr..., such that 



and M(E i ) is finite for every i. Consider, for example, random points 
on a line and suppose that the number occurring in any interval of 
length 1 is a Poisson variate with mean 1, and is independent of what 
happens outside this interval. Then M(E) will be equal to the Lebesgue 
measure for any Lebesgue measurable set E on the line, and although 
M(2), the length of the whole line, in infinite, 1 can be represented as 
the sum of an enumerable number of intervals of finite length. 

1.9 Let N be the number of geometric elements whose parameters lie 
in the set E in the parameter space. AT is a Poisson variate with mean 

A = M(E) = f dM, (1.1) 

J E 

and therefore has a generating function 

dM. (1.2) 



{ 

J 



E 

Conditional on the fact that there are exactly N such elements, the 
joint distribution of their coordinates #!,...,## is 



JU 
dM( Zl )...dM(z N ) (1J) 



so that for N = I we get a relationship between the measure M(E) and 
the correspondingly induced P(E). When M(E) is not the integral of 
its derivative, (1.3) can be similarly interpreted when integrated over 
any subset, G say, of E. Then the probability of a single element having 
its coordinates in G, if it is known that they are in E, is 



J 



dM(z) 

, (1.4) 

dM(z) 



where GE is the set common to E and G. Notice that p(G \ E) is a set 
function of two sets E and G. 

1.10 A problem in geometrical probability will, therefore, not be defined 
until we have chosen a probability measure P(E) in the case of a fixed 
number of independently distributed elements, or a measure M(E) 
in the case of a random number. Such a choice is quite arbitrary in 
general, and it is the failure to recognize this fact which leads to "para- 
doxes". However, some choices of such measures are both more useful 
and more intuitively sensible than others, and we need to consider some 
criterion of choice. 

Choice of a probability measure 

1.11 Most problems in geometrical probability are concerned with 
elements in Euclidean space and with properties which are invariant 
under the group of transformations which is appropriate to Euclidean 
space, i.e. all translations, rotations and reflections. Consider then a set 
A of geometric objects in Euclidean space. We obtain all translations 
by transforming the coordinates by addition of constants, and all rota- 
tions and reflections by transforming them by an orthogonal trans- 
formation. For example in a three-dimensional space the coordinates 
Xy y, z will be transformed into new coordinates x'y y' , z' by 

X' = X-^a 



Z* = Zo + a 31 X + a 32 y + aa 3 Z, (1-5) 

where (a^) is an orthogonal matrix and the set A will be transformed 
into a new set A'. If the set of parameter points corresponding to A in 
the parameter space is E, the set corresponding to A' will be ', a trans- 
formation of the set E induced by the transformation in the Euclidean 
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space, and in the choice of P(E) or M(E) we naturally impose the con- 
dition that P(E') = P(E), or M(") --- M(E). Thus the choice of these 
measures depends on the measure-preserving properties of such groups 
of transformations. 

As particular examples we consider in paragraphs 1.12-1.20 the fol- 
lowing five cases. 

Points in Euclidean space of n dimensions 

1.12 Here the parameter space is the same as the space of elements, and 
if the measure of a set is to be invariant under translations and rotations, 
the natural measure to use is Lebesgue measure. If we are considering 
the distribution of a single point, or a finite number of points, we cannot 
allow it to range over the whole of any set with infinite measure but must 
confine it to a region of the space with bounded measure (e.g. the inside 
of a cube, or sphere). Let this region be R and its Lebesgue measure 
be m(R). Then we take P(E), the probability of the point lying within 
a set E contained in /?, to be 



If we arc dealing with an arbitrary (i.e. random) number of points, 
we suppose that the number of such points lying in a set E of finite 
Lebesgue measure is a Poisson variate with mean Xm(E) = M(E\ 
where A is a constant. If the number of such points is n y the conditional 
distribution that they lie in sets E ly ...,E n contained in E, given n, is 

m(E 1 )...m(E n ) 
m(E) n 

provided they have each been provided with suffixes so that they can 
be distinguished. 

Straight lines in two-dimensional space 

1.13 Suppose these are defined by equations of the form: 

ux + vy+l = 0. (1.6) 

Then u and v may be regarded as the coordinates of the line, and the 
parameter space is the two-dimensional space with coordinates (u, v), 
excluding the point (0, 0). The representation excludes lines passing 
through the origin, but this will not cause any difficulty. A translation 
and rotation in the two-dimensional plane can be represented by the 
algebraic transformation : 

X = a -I- x cos a y sin a, 
Y = b + x sin a +y cos a, 
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where a, b are arbitrary constants, and < a < 2n. The new position 
of the line after the transformation will be given by 

UX+VY+l = (1.7) 

where 

t/cosa-f Fsina 

M= aC/+AF+l ' (L8) 

t/sina-h Fcosa /4 rtv 



and in the new position the line must again not pass through the origin 
as otherwise the denominator in (1.8) and (1.9) would vanish. Now 
suppose we look for a measure function, M(E), defined by an integral 
of the form 

nF(u y v)dudv, 
K 

which we want to equal 

(( F(U y V)dUdV. 

J J K 

The former is equal to 



where 

du du \ 
d(u, v) _ ~dU W | 



(1-10) 

w W : 



Since equality is to hold for all sets E', we must have 



Straightforward evaluation of the Jacobian (1.10) gives 



d(U,V) 
Squaring (1.8) and (1.9) and adding, we get 



so that (1.11) will be satisfied if we take F(u, v) proportional to 
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and 



dudv 



Notice that for this technique to work we require that the Jacobian be 
of the form ^ (u y . . .) (f> (J7, . . .) ~ l . Having found M(E) we can then find 
P(E), the probability corresponding to a single line, whenever the latter 
is restricted by conditions which ensure that the coordinates u and v lie 
outside some circle of non-zero radius surrounding the origin in the 
(u y v) plane. This ensures that the integral (1.12) converges. 

1.14 In many applications it is more convenient to represent a line in 
a plane by the polar coordinates of the point where the perpendicular 
from the origin meets it. If these are (p, 9) we can write the equation of 
the line as 

x cos y sin 

-P -P 
and the differential element 

dudv 



transforms into 

dpdO, (1.13) 

because we can put 

u = -p~ l cosO, v = p~ l smO, 
so that 



d(p,0) 
and 



Notice that (1.13) corresponds to the second solution of Bertrand's 
problem. Moreover, (1.13) is nearly obvious as the natural measure 
invariant under translations, rotations and reflections. Poincare (1912) 
has shown that (1.13) is the only such differential element which re- 
mains invariant under the group of all translations and rotations. 

Straight lines in three-dimensional space 

1.15 Such lines form a four-dimensional manifold and therefore re- 
quire four parameters, It is convenient to represent them by the 
equations 

x = az+p, 

(1.14) 
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so that (a, 6, p, q) are the parameters of the line in a four-dimensional 
space. This omits the set of all lines parallel to the plane z 0, but 
as these will turn out to have zero measure this will cause no difficulty. 
We could now proceed exactly as before by considering the effect 
on these parameters of all translations and orthogonal transformations. 
However, it is of some interest to illustrate the method of finding such 
invariant measures by studying infinitesimal transformations in this case. 

1.16 Suppose that we have a space determined by n coordinates 
(#!, ...,x n ) and a continuous group of transformations which is itself 
determined by r parameters. We look for an integral of the form 



= J 



(1.15) 



which is to be invariant under any transformation of the group, and 
must therefore, in particular, be invariant under any infinitesimal trans- 
formation of the group.f 

Suppose the transformations of the group can be represented as 

x'i = fi(xi,...,x n <*i,.. .,a v ), (1-16) 

where t 1, ...,n. Then one possible infinitesimal transformation from 
x to x' will be of the form 

Xi = ** + &(*!> ~,Xn)dt (1.17) 

where dt is an infinitesimal of the first order and the f t - are to be deter- 
mined. The Jacobian of this transformation will be 



' f ! + *-* 

OX l OX 2 

A = ; 



The effect of the transformation on F will be to turn it into F-h dF 
where 

dF 



t Deltheil (1926) gives a more extended discussion of infinitesimal groups 
and their infinitesimal generators. See also Santal6 (1953). 
G.P. B 
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and so J becomes 



If/ is to remain invariant under all such transformations, we must have 

6F+F8fZ^ = Q. (1.18) 

OXj 

Inserting the value of F we get 



which is to be true for all t so that 



There will be r independent infinitesimal transformations corresponding 
to the r parameters a l9 ... 9 a r so that there are r sets of functions ^ which 
we shall denote by (ij..- w -) (j = !>> T). We therefore have r inde- 
pendent equations 



which will, in general, be sufficient to determine F. 

1.17 In the case of random lines in a three-dimensional space we have 
four coordinates (, b, p, q) for each line and we wish to find a function F 
such that 



is invariant under translations, rotations and reflections. The group of 
transformations has six parameters, and the infinitesimal transforma- 
tions can be taken as infinitesimal translations in the directions of the 
x, y, and z axes, and infinitesimal rotations about these axes. For each 
of these we have to study the effect on the coefficients a, b> p, and q. 
From (1.14) we see that the translation x' = x+dt results in 

a' = a 

and p'=p-dt\ (1.20) 

and similarly y' = y + dt results in 

V = b 

q' =q-dt. (1.21) 

The translation z' z+dt has the double effect 
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a' = a, p' = p + adt y 

V = b, q' = q + bdt. (1.22) 

The rotation dt around the z axis results in 
x' = xydt = az+p, 
y' = y-}- xdt = bz+q y 

and solving as a pair of equations in x and y, and ignoring terms which 
are of o(dt) y we find 



y = - 
so that the induced transformations are 

a' = a + bdt, 
b' = b-adt y 
p' = p + qdt y 

q' = q-pdt. (1.23) 

The rotation round the x axis given by 

dy zdt y dz = ydt 
results in the pair of equations : 

x = a(z+ydt)+p, 
y zdt = b(z+ydi) + q. 

The effect of this on the parameters is therefore given by : 

a' 



p' = 

q' = q + bqdt, (1.24) 

which are obtained by transforming the pairs of equations into the 
standard form and omitting terms of o(dt). Similarly the rotation about 
the y axis gives : 

a' = a + (l+a*)dt, 

V = b + abdt, 

p' = p + apdt y 

<f = q + bpdt. (1.25) 

Equations (1.20)-(1.25) give the values of ^ in (1.18) for each of the 
six transformations. For example, from (1.20) and (1.17) we see that 
for the first transformation, of the four values of f corresponding to 
a, b,p, q in that order, three are zero and f 3 = 1. Equation (1.19) 
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then gives 






dF 

and likewise -r- = 0. (1-27) 

dq 

(1 .26) and (1 .27) imply that F is independent of p and q. Equation (1 .22) 
then leads to 



(1.28) 



and this is identically true. The fourth equation becomes 
a a dF dF 



This immediately implies that F is a function of a 2 + b 2 only, since it 
does not vary when we vary a and b in any manner which keeps a 2 + b 2 
constant. Hence we can write F = </>(a 2 4-i 2 ), and the fifth and sixth 
equations are then both equivalent to 

(l+a 2 + b 2 )<f>' (a 2 + b 2 ) + 2<f>(a 2 + b 2 ) = 
so that 

F = (l+a 2 + 6 2 )- 2 , (1.30) 

or some multiple of this. 

This implies, just as we might expect, that all points on the plane 
z = are equiprobable (if restricted to any bounded region) and that 
all directions are equally probable. 

Planes in three-dimensional space 

1.18 We can determine such planes by the length of the perpendiculars 
on them from the origin together with the polar coordinates of these 
perpendiculars, so that their equations in Cartesian coordinates are 

tfsinfl cos<f)+y sinfl sin (f> + #cos# = p, (1-31) 

where ^ 6 ^ 7t y Q ^ <f> ^ 2n, ^ p < oo. Then intuition suggests 
that the measure invariant under all translations and rotations will be 
such that all values of p will have the same "probability" and all direc- 
tions of the perpendicular. This means that the element of measure, 
in analogy with lines in a plane, will be given by 

dj = smOd6dpd<f>. (1.32) 

Consider the representation in Cartesian coordinates which is 

ux+vy + wz+l = 0, (1.33) 
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and 

J = F(u>v,w)dudvdw. 

The translation x' = x -f dt gives the equation 

ux + vy + wz+l+udt = (I- 34 ) 

and on turning this into the form (1.21) the effect on the parameters is 
seen to he 

u' = u-u*dt, 
v r v uvdt) 
w' = w uwdt. (1.35) 

Parallel results apply for the other two translations of y and z and equa- 
tions (1.19) become 



/a a a\ 

[u* + uv-- + uw ]F = 

\ OH OV OW/ 

( a a a\ 

I uv -~ + v 2 + vw -r ]F =0 
\ on ov ow/ 



~- + w 2 --~}F = 0. (1.36) 

ou ov owl 

In a similar manner the three infinitesimal rotations will be found 
to give 



Using these three we get 

M - 1 ^ = z; -T a / = K ,-i^ = o (1.38) 

a^ a# aw 

which implies, on integration, that 



The translation conditions then give 

2^(M a + w 2 + ; 8 ) + (M a + v 2 + tt; a )f (u a -h; a + w a ) - (1.39) 
so that 
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On transforming back to polar coordinates this becomes 

smOdOdpd<f> 

as conjectured. Notice the similarity of form between (1.12), (1.30) and 
(1.40). Polya (1917) has shown that (1.22) is the only differential ele- 
ment which remains invariant under the group of all translations and 
rotations. 

Rotations 

1.19 We consider rotations around a fixed point. In two dimensions 
the theory is simple. A rotation is defined by an angle such that 
< < 2ji and the sum of two rotations will be given by the sum of 
the corresponding 0's reduced modulo 2n. The parameter space is now 
bounded and it is natural to define a measure on it by a probability dis- 
tribution of in the range (0, 2ri). The group under which the measure 
is to remain invariant is then naturally taken to be the rotation itself, 
and this forces the choice of the probability distribution which is uni- 
form on this interval. 

1.20 Rotations in three dimensions are somewhat more complicated. 
They may be defined either by choice of an axis of rotation and an angle 
through which the space or object is to be rotated, or by the choice of 
an orthogonal matrix representing the rotation. Here again the para- 
meter space is bounded, and the natural group to use is the group of 
rotations itself. The resulting theory is described in Chapter 4. 

1.21 Other classes of geometrical objects can be treated in the same 
way, provided we have a continuous group which is " transitive", which 
may be interpreted as the requirement that it is always possible to find 
an element of the group which will transfer any of the set of objects 
into any other. An example where this does not occur is given by the 
set of all circles in a plane when taken with the Euclidean group of all 
translations and rotations. It is not then possible to set up an invariant 
measure for this set since it is not possible to transform a circle into 
another circle with a different radius. 

Axiomatization 

1.22 Once the probability measure of a geometrical set is constituted, 
the solution of particular problems can proceed without the appearance 
of paradoxes or of difficulties touching on axiomatization. However, for 
the benefit of those who are more concerned with fundamentals we may 
note, in concluding this chapter, that A. Renyi (1955) and A. Csaszar 
(1955) have constructed an axiomatic theory of probability which is 
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specially relevant to problems of geometric probability. It is based on 
KolmogorofFs axiomatic foundation combined with the idea of condi- 
tional probability, which is taken as undefined. 

Suppose we are given a space in which there is a cr-field &f of sets 
A, B y . . . and let 38 be a sub-set of this cr-field. Then a conditional prob- 
ability P(A | B) is defined to be a numerical function of the two sets 
A and B which satisfies the following three axioms : 

Axiom I. P(A | B) > 0, and P(B \ B) = 1 if B belongs to &. 
Axiom II. For any fixed B, P(A\ B) is a measure. 
Axiom III. If A and B belong to <*/, and C and BC to ^, then 
P(A | BC) P(B | C) = P( AB | C). 

Then under wide conditions it can be shown that P(A \ B) can be repre- 
sented in the form 



(1.41) 
where Q(A) is a non-negative measure on the a-field t c/, and Q(B) > 0. 

1.23 This is just the sort of thing we need in geometrical probability 
where we want to deal with questions such as "what is the probability 
that a point lies within a figure K l if it is known that it lies within a 
figure K 2 " or "what is the probability that a line intersects a figure K l 
if it intersects K 2 ". Here the appropriate measure for random points or 
lines is necessarily such that it is infinite for the whole space and is 
therefore unsuitable for use as a probability, but the questions involve 
conditional distributions. Hence M(E) corresponds to Renyi's measure 
Q(B) and from it we can derive the required conditional probability 
distribution 

f dM(z) 
f dM(z) 

J E 

as in (1.9) A situation similar to this occurs in the application of prob- 
ability to number theory. 
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CHAPTER 2 

DISTRIBUTIONS OF POINTS IN 
EUCLIDEAN SPACE 

2.1 As we have shown in the last chapter, the natural measure to use 
for the distribution of points in a Euclidean space is Lebesgue measure, 
so that M(E) is n-dimensional Lebesgue measure if the points are dis- 
tributed in w-dimensional space. If the number of points is to be ran- 
dom we suppose that the number occurring in any measurable set E is 
a Poisson variate with mean AM(), where A is some constant ; and if the 
number is fixed, we suppose that the probability that it falls in a set E 19 
given that it falls in E ^ E l9 is 



Nearly all the problems of interest arise when n = 1, 2, or 3, but 
before considering them it is convenient to prove here a general formula 
due to Crofton (1885) for problems involving a fixed number of points. 

Crofton's theorem on fixed points 

2.2 Suppose that we have N points which are independently distributed 
in a domain, Z), in n-dimensional space. The measure of the space (of 
Nn dimensions) for the joint distribution is therefore {m (D) } N . Suppose 
that we wish to calculate the probability that the figure, F> formed by 
the N points has a certain property which is defined in such a manner 
that it depends only on the relative position of the points (thus the prop- 
erty must be invariant under translations and rotations), and not on 
the domain Z), or on the relative position of D and F. For example, the 
property might be defined by the condition that no two of the points are 
more than some specified distance apart. Then the probability that the 
figure F satisfies the condition is 

m*(E) ..^ 

. p =<4)r (2 - 2) 

where m* (E) is Lebesgue measure in the TVn-dimensional space, and E 
is the set of points in this space at which the figure F has the required 
property. 

Write V m(D) and let D l be another domain containing D for 
which 

m(D l ) = V+kV. (2.3) 



25 

Let the corresponding set in the Nn space be E l => E y and put 
m*(E) = [/, m^(E l ) - U l = t/+At/. 

Then the probability that Fhas the required property for random points 
in D l is 

Pl = P+AP = (2.4) 



Consider the set E v This can be divided into N+l sets ^ (j = 0, 
1, ..., N) y where Ey is the subset of E l for which j of the points lie in 
D^ D and Af / in D. Let P, be the probabilities that points in E lf 
correspond to figures in D l for which the required property holds. Then 



U l = C7+AE7 = U+ S pyF^AFy. (2.5) 

Hence from (2.4) 

(P+AP)(F+AF)" = 
so that 

AP(F+AF)" - Z ( N }(Pj-P)V N ->(AFy. (2.6) 

j-=i\7/ 

We now let AF become small. Then to the first order, writing 6P y dV 
for small increments, we have 

6P= N(P l -P)V~ l dV. (2.7) 

This is Crof ton's formula. By its use a number of awkward problems can 
be solved because if we can calculate P x , which is usually much simpler, 
we can set up a differential equation for P in terms of some parameter 
which varies the measure F of the domain D. 

2.3 A very simple example may be given of this method. Suppose A 
and B are random points on an interval of length a and we require the 
distribution of x = \AB\. Suppose the probability that x < uisF(u,a). 
Let the length of the interval have an increment da. Then from (2.7) we 
have 

dF(u,a) = 2(F 1 (u,a)-F(u y a))a-ida y (2.8) 

where J^ (u, a) is the probability that x < u when B is at one end of the 
interval. Clearly F l (u y a) = ua~ l . Inserting this and assuming F has a 
differential coefficient with respect to a y we get 



da 
so that 



(***>, a) )-2u, 
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and 

a 2 F(u y a) = 2au -f constant. 
For x = u we must have F = 1 so that the constant is w 2 , and 

F(u,a) = a- 2 {2au-u 2 }. (2.9) 

Crofton's theorem on mean values 

2.4 Further examples of this method will occur later. It may also be 
applied to finding the mean values of some numerical function, F, of the 
F. This is a function of the N sets of coordinates in the parameter space. 
Let JM be the mean of Y. Then 

p = WV~ N , (2.10) 

where V = m (Z)), and W is the integral of Y over the JVw-dimensional 
product space. We now proceed as before and obtain 

(/i + A/0(F+AF)* = W+Wi+... + W N , (2.11) 

where W k is the integral obtained by taking all cases in which there are 
k points in D l D and Nk'mD. We let m (D l D) tend to zero and 
we have asymptotically that 

W t = NfiiV^&V, (2.12) 

as AF > 0, whilst PF 2 , ..., W N are o(AF). Here [i l is the mean value of 
Y when one of the points is constrained to lie in D 1 D. Hence 

dp, = N(/^-iu)V- l dV. (2.13) 

(2.7) can be regarded as a particular case of this which is obtained when 
Y is a function defined to take the values 1 , according as the figure F 
satisfies the required condition or not. 

2.5 As an example consider the previous illustration where A and B are 
random points inside an interval of length a and we require the mean 
value of x p = \AB\ v (p > 0). If B is at one end of the interval the mean 
value of x* is 

f rt 
fa = a~ l x p dx = 

J o 

(2.13) then becomes 



and integrating we obtain 

* - (p+W+2) (2 ' 14) 

which could have been obtained directly from the previous result. 
Crofton's formulae (2.7) and (2.13) were established for random 
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points, but the same technique could be used for more complex random 
elements. 

Distribution of points on a line 

2.6 Now consider random points on an infinite line. From the discus- 
sion above we see that this is naturally defined by ensuring that in any 
set of non-overlapping intervals I ly 7 a , ... the numbers of random points 
which occur are independent Poisson variates with means A 1 7 X |, A 1 7 2 1, . . . 
where A is a constant known as the density of the distribution. Thus, 
given any point P, the probability that there is no random point in an 
interval of length a containing P, or extending from P to the right (or 
left), is 

e~ A ". 

Hence, if P is any point, and in particular one of the random points, the 
probability distribution of the distance x to the nearest random point to 
the right (or left) is 

dF = Xz~ xx dx. (2.15) 

In the language of statistical theory we may also express this by saying 
that 2Xx is distributed in the % 2 distribution with two degrees of free- 
dom. f 

2.7 We can also regard random points on a line as a realization of a 
"Poisson Process". Suppose that N(a, b) is the number occurring in the 
interval (#, b) and that this has a distribution {P n (a, 6)}, n = 0,1,.... 
Using the assumptions made above, we can derive differential equations 
for P n (a,b) by considering what may happen in an interval (a, b + d), 
where d is small. Then 

P n (a,b+d) = P n (a 9 b)(l-M) + P n _. l (a 9 b)M + o(d) 9 (2.16) 

where P n = for n < 0, and we assume that the probability of a single 
point occurring in an interval (A, b + d) is Ad + 0(<5), independently of 
what happens elsewhere. Assuming that P n (a y b) is differentiate with 
respect to b we get 

1 (a,6). (2.17) 



, n 

Solving these equations for n = 0, 1,2, ... in succession we get 

P n (a,b) = {A(ft-a)}e-*<->(!)-i. (2.18) 



f We assume here and later that the reader is acquainted with the elementary 
distributions of statistics and with the characterization of such distributions by 
moments. Cf. Kendall and Stuart, vol. I, 1958. 
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Random division of a magnitude 

2.8 A more interesting situation arises when we have a fixed number of 
random points in an interval whose length we may take as unity. These 
will divide the interval into n -f 1 random intervals which we denote in 
order along the line, together with their lengths, by I 19 . . ., / n+1 . Suppose 
that the interval is (0, 1) and that we have n random points whose coordi- 
nates X 19 ... 9 X n are independently and uniformly distributed on (0, 1). 
Then their joint distribution can be written 

dX l ...dX n (Q ^ X^ 1). (2.19) 

This can be regarded as a joint uniform distribution on an w-dimensional 
cube. Write Y l9 ..., Y n for X l9 ...,X n arranged in increasing order: 

< Y l < Y 2 < ... ^ Y n ^ 1. 

Then the cube is divided into n \ equal regions in which the density is 
uniform, and the joint distribution of the F t is given by 

n\dYi...dY*(Q < Y l < ... < Y n < 1). (2.20) 

We then have 

A = Y lt 

T V V 

2 2 J 2~ -* l> 

T V V 

-*n * n -* n 1> 

4,1 = i-y. 

The joint distribution of I l9 ... 9 I nn is singular since S/, = 1, but 
we can determine the joint distribution of 7\, ...,/ which, by calculating 
the Jacobian, is seen to be 

nidi,. ..dl n (S/, ^1) (2.21) 

i 

Each interval considered by itself therefore has the same distribution. 
This could also have been seen by observing that the n + 1 intervals are 
jointly distributed in the same way as the n+l intervals obtained by 
placing n+ 1 points at random on a circle of unit circumference. 

2.9 To determine the distribution of a single interval consider I k ^ l 
= Y k h t - Y k . Integrating (2.21) over the possible values of Y 19 . . ., Y k _, , 
and Y^g,... Y n we get 

^(\ _ X \w fc i 



for the joint distribution of Y k and 5^+!, and hence the distribution 



of I k+l s 
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-.(I-/,,.,)- 1 "".*!- (2.23) 

This is independent of A and could have been obtained directly by ob- 
serving that all the 7's have the same distribution, and (2.23) is obviously 
the distribution of I v 

2.10 We can relate the distribution of n points in an interval to a 
Poisson process in two ways. Suppose first that we have a Poisson pro- 
cess starting from zero and such that events happen at times Y l9 Y 2 > 
(0 ^ Y i < oo). We consider the conditional distribution of Y ly ..., Y n 
conditional on Y n < 1, Y n+l > 1 and show that in these circumstances 
the n points Y 19 ..., Y n are n points distributed at random in (0, 1) and 
rearranged in order. The probability that Y 19 ..., Y n are such that 
Y n < 1 , Y n , 1 > 1 and have specified values is given by 



re^rf/,, (2.24) 

i \ i / i 

where the I { are defined as before. The conditional probability will be 
obtained by dividing this expression by its integral over the region 

n 

S 7; < 1 which is clearly 

(n\)-^"e~\ (2.25) 

and the joint distribution of the 7 Z is therefore 

n\dl^...dl n (S /<<!), (2.26) 

so that the joint distribution of the Y i is 

n\dY l ...dY n , (2.27) 

which shows that the points Y v ..., Y n may be regarded as n random 
points in the interval (0, 1). 

2.11 We can also obtain a different representation which is more useful 
in some problems. Consider again a Poisson process with events occur- 
ring at times Y l9 ..., Y n , Y n+l , ..., and write 



= Y 2 



Z n = Y n F- f V (2.28) 

Then ^ Z i ^ 1 and Z x , ..., Z n are distributed in the interval (0, 1) as 
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an ordered set of n random points. To see this we observe that the joint 
distribution of Y lt ..., Y n+1 \s(Y 1 ^ ... ^ Y n+l ) 

^exp-lY n , l dY l ...dY n+l , (2.29) 

whilst the distribution of Y n+l by itself is 

(!)-i y: +1 A" +1 exp-Ay, 1+1 rfF nK . (2.30) 

Thus the conditional distribution of Y ly ..., Y n given Y n+l is 

(n\)-iY^ 1 dX 1 ...dX n , (2.31) 

and changing to Z's this becomes 

(nl)~ l dZ l ...dZ n . (2.32) 

Another representation is useful. Since Y, Y i _ l is distributed as 
(2X)- l % 2 with two degrees of freedom the joint distribution of I l9 ..., 7 nf l 
is that of the quantities 

(4-i + 4) 's*^^ f=l,..-,n + l, (2.33) 



where the jc t are independent random variables which are distributed 
normally with zero means and the same standard deviation. 

2.12 Since the joint distribution of the intervals 1^ ... / n+1 is the same 
as that of the n -f 1 quantities 

W 1 (W l + ... + W n ^)-\...,W ll+l (W 1 + ... + W, l+l )-\ say, 
where the W t are independently distributed in the same negative expo- 
nential distribution, we can give the joint distribution of the /^ another 
geometrical interpretation. Regard (W iy ..., W n] . t ) as a point in (n+ 1)- 
dimensional space. The 7 ?; depend only on the ratios of the W iy and the 
probability density of the joint distribution of the W t depends only on 
] W { . It is therefore constant on the plane S W t = 1. This plane cuts 
the positive part of the space in a regular simplex. This is the w-dimen- 
sional generalization of the equilateral triangle, regular tetrahedron, etc. 
If we now transfer our attention to this w-dimensional space for some 
fixed value of S W t and choose a scale such that the distance of each 
vertex from the opposite side is unity, the joint distribution of /!,..., / M+1 
is that of the lengths of the n+ I perpendiculars from a random point 
inside the simplex on the n+ 1 sides. 

The advantage of this representation is that one can sometimes ob- 
tain explicit formulae for probabilities by using the geometry of a sim- 
plex. On the other hand, the advantage of (2.33) is that it simplifies the 
calculation of moments. To illustrate this consider the expression 
((2.33)): 

_ A 

~' c } 
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The distribution of A/B depends only on the ratios of the x t . Since the 
joint distribution of x ly ... # 2w is spherically symmetric, the distributions 
of A/B and B are independent and hence 

MV 
3/ 



E 



B) 
and thus E( A \^^ (>0 

\ -O/ 2-j\JLj ] 

This greatly simplifies the finding of moments. 



(2.35) 



Some special problems 

2.13 We can now consider a series of particular problems of which a 
large number, mostly very special, have been constructed. Thus, sup- 
pose a unit interval is divided into three parts by two points at random 
and we ask for the probability that it is possible to construct a triangle 
having these subintervals as sides. The required probability is the prob- 
ability that no one of the intervals is greater than the sum of the other 
two. Using the simplicial representation, which is here a triangle, the 
region in which the representative point must lie for a triangle to be pos- 
sible is the region inside the triangle formed by joining the midpoints 
of the sides as illustrated in Fig. (2.1). 




Fig. 2.1 

This has an area equal to one quarter of the larger triangle so that 
the probability is J. 

The largest interval on a line 

2.14 A more interesting problem is to determine the distribution of the 
largest of the n intervals found by n 1 random points in a unit interval. 
The probability that the largest interval is greater than x 9 say, is the com- 
plement of the probability that no interval is greater than x. Let us find 
more generally the probability that exactly k intervals are greater than x, 
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where k = 0, 1, ..., and kx < 1. In a simplified form this problem was 
originally given by Whitworth (1901, Problem 667, p. 361) but is pos- 
sibly earlier in origin. We shall prove that the probability is 



*r-y (2.36) 

where the series stops at the last term for which 1 (k + s) x is positive. 
Let the intervals be numbered I ly . . . , I n and write P^ k for the prob- 
ability that Ii > x, Ij > x, . . . I k > x, whatever the size of the other in- 
tervals. Let S m be the sum of P {j k for all selections of m out of the 
n intervals. Then if the events E^ >-,E n are defined to be the events 
that /i > x y ...,I n > x, a well-known combinatorial theorem states that 
the probability that exactly k of these events occur is 

o A+r 



This may be easily established by using indicator variates (equal to 
1,0 according as an event does or does not occur) or by induction 
(Feller, 1950). 

S m is clearly f J times the probability that / x > #, ...,/ w > x, and 

using the geometrical representation of the /,- as perpendiculars on the 
faces of an (n l)-dimensional simplex we see that this is obtained by 
finding the volume of a regular simplex all of whose sides are reduced 
in ratio (1 mx). Hence S m is equal to 



if < mx < I and is zero otherwise. Then P [k] is given by 



which is equal to (2.36). 

This gives the distribution of the largest interval on putting k = 
since P f0] is the probability that the largest is less than x. This is 



-2*r 1 -... (2.37) 

The probability that the largest exceeds x is then 

-2*r 1 + ... (2.38) 
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2.15 These results were also obtained by Fisher (1929, 1940) for 
use as a test of significance in harmonic analysis. Both of these 
papers were reproduced in his collected papers with additions, and in 
the first of these he gives a table of the 5% and 1% points of (2.38) for 
n 5(1)50. In the second paper he also gives the 5% points of the 
second largest interval for n = 3(1)10(5)50. In Davis (1941) there is 
a table for nx = 0-1 (0-1) 10-0, n = 10 (10) 70 and for nx = 5-1(0-1)10-0, 
n = 80(10)300. These formulae have been applied by F. Garwood 
(1940) to the theory of the operation of vehicular controlled traffic lights, 
and he also gives another proof. The problem has also been discussed 
by Baticle (1933a, 1933b, 1935) and Levy (1939). 

2.16 A geometrical problem of an apparently different kind has been 
considered by Stevens (1939) but, as Fisher (1940) points out, it is 
exactly equivalent. On the circumference of a circle of length unity n 
arcs of length x are placed at random and we ask for the probability that 
the circumference is completely covered. Consider the arcs as being 
determined by their most clockwise points. These determine n intervals 
whose joint distribution is that of the n intervals formed by n 1 random 
points in an interval of unit length. If, and only if, none of these lengths 
is greater than x, all the points of the circumference are covered. A proof 
of the result in this form was given by Stevens. Mauldon (1951) has 
given the exact distribution of the sum of the m largest intervals for 
m 2, 3, ... 

2.17 Stevens' problem is about a set of random intervals on a circle. 
The analogous problem in three dimensions is that of finding the prob- 
ability that the surface of a sphere is completely covered by a set of ran- 
domly placed circles. No exact solution is known, but the problem and 
its biological origin will be discussed in Chapter 5. 

2.18 Another problem of some interest is the distribution of S/^. This 
was originally proposed as a test of significance for the statistical study 
of infectious diseases by M. Greenwood (1946). The exact distribution, 
like the one in 2.14, is analytic only in strips and is known exactly 
for n = 2, 3 , 4 only. The moments and the asymptotic form are given 
by Moran (1947) and the solution for n = 4 by Gardner (1952) (see also 
Moran (1951, 1953)). The moments are not difficult to find, either by 
using the representation (2.35) or by using a theorem of Dirichlet 
(Whittaker and Watson (1935), p. 258). 

lff(x) is integrable on the range (0, 1) and a r > 0(r = 1, ..., n) then 

G.P. c 
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taken over the region x t > 0, S#, < 1, is equal to 

Sai _ 1 ^ 



o 

This is easily proved by successive reduction. In particular, if f(t) 
is equal to unity, (2.39) is equal to 



If the integral was taken over the range < Zjc,- ^ C it would be 
multiplied by C^ 1 , and differentiating with respect to C we see that the 
integral of 

Y &il v ocn-l 
x l " X n 

over the (n l)-dimensional region defined by x f > 0, %x t : = 1, is 
equal to 



r ( ai +...+) 

Putting a x = . . . = a n = 1 we obtain the volume of this region to be 
equal to T (n) -1 , and then, using the representation of the n intervals 
formed by n 1 random points as the homogeneous coordinates of points 
inside the simplex, as explained before, we find that 



This was originally obtained in a very slightly different form by Whit- 

n 

worth (1901) (Proposition LVII). Applying this to the sum T = S/f 

i 
we get 

(2.44) 



, (2.45) 

and so on. 

2.19 The distributions considered above can both be used as a test for 
the hypothesis that the n 1 points are in fact distributed at random. 
Another such test was suggested by Bartlett in the discussion on Green- 
wood's paper (1946, p. 108). Since we have shown above that 7 X , ...,/ w 
are distributed as W i (W l +... + W n ) - 1 (i = 1 , . . . , n) where the W t are 
independent random variables distributed as % 2 with two degrees of free- 
dom, we could use any standard test for the homogeneity of variances. 
In particular a likelihood ratio test must be based on some function of 



35 

n // 






2 /, (2.46) 

1 = 1 

The distribution of this quantity has been extensively studied (for 
references see, for example, Kendall and Stuart (1958)). 

Kolmogoroff's test 

2.20 The theory of the distribution of random points in an interval is 
also related to another classical problem in mathematical statistics, that 
of the goodness-of-fit of a sample to a postulated theoretical probability 
distribution. Suppose that F(x) is a probability distribution which is 
continuous, i.e. has no discrete component. If x ly ... y x n is a sample in 
order of magnitude, from this distribution, define the sample distribu- 
tion function, F n (x), by 

F n (x) = 0, x < #!, 

= kn ~ , x k < x ^ x k 1 1 , 
-I, x n < x y (2.47) 

and the divergence from the theoretical distribution by 

D n = least upper bound | F n (x) - F(x) \. (2.48) 

The advantage of this criterion of divergence between F(x) and 
F n (x) is that it remains invariant when x is made to undergo any con- 
tinuous monotonic transformation. We can therefore suppose that 

F(x) = 0, x < 0, 

= x, ^ x < 1, 

= 1, x > 1. (2.49) 

The sample is then that of n random points in an interval and in the 
notation used before, 

D n = max max {| Y i -in^\,\Y i -(i-\}n^\}. (2.50) 

Ki<n 

Kolmogoroff (1933) proves that for x > 0, 

lim Pr {n*D n < x} = 1-2 S (-1)'- 1 . a- 8 **, (2.51) 

n->oo = 1 

the right-hand side being a Jacobian theta function. This distribution 
has an extensive literature, since D n has a wide application as a test of 
goodness of fit (it should, however, be noticed that the above result does 
not hold for distributions fitted to the sample by estimating para- 
meters).f The other distributions discussed above could also be used 

t See Kendall and Stuart, vol. II, 1961. 
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in this manner as tests of goodness of fit but, unlike Kolmogoroff 's test, 
require the transformation to a rectangular distribution to be carried out 
before the test is applied. 

2.21 Considered as tests of significance, the relative values of all such 
distributions must depend on the alternative hypotheses envisaged. 
Thus in the epidemiological problem considered by Greenwood, Moran 
has shown (1951) that for a very plausible alternative Bartlett's method 
using a homogeneity of variance test is better than the use of X | /,- | 2 . 
A particularly valuable and interesting survey of these and other tests, 
and of their powers, is given by Cox (1955). 

Non-overlapping intervals on a line 

2.22 Next suppose that, instead of considering random points on a line, 
we try to define random non-overlapping intervals. Consider first an 
infinite range. The occurrence of non-overlapping intervals could be 
regarded as the realisation of a random process in which t is the time 
coordinate on the range ( oo, oo). If there is no interval covering a 
point / we may suppose that the distribution of the length, T l say, to 
the nearest interval on the right is distributed in a negative exponential 
distribution, 

Ae-^'rfTj (2.52) 

with mean A" 1 , independently of what happens to the left of t v . If the 
interval then beginning has the constant length d, the line will again be 
uncovered at t-\-T l -\-6 9 and the distance to the next interval is again a 
negative exponential variate T 2 with mean X - 1 . Carrying on in the same 
way we thus define a random process. This type of model occurs in a 
number of practical situations. Thus it may be applied to represent the 
distribution of traffic along a road, or passing a given point during a 
period of time. However, in this case, if d is taken as the length of a car, 
the model appears to be satisfactory only when the traffic density is quite 
low. Better results may be had if d is taken as larger than the car length. 

The model can also represent the intervals during which telephone 
conversations are held on a line from a single subscriber, provided there 
is never any blockage and the holding time of each conversation is a 
constant equal to d. 

Finally the model can also represent the sequence of intervals during 
which a counter is occupied when it is actuated by a random sequence 
of events, such as the passage of cosmic ray particles, and is such that 
after registering each event the counter is insensitive for a period d. 

All three of these subjects have extensive literatures which are not 
really relevant to the subject of geometrical probability, and we consider 
here only one problem which arises in this model. 
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2.23 Supposing that at time / = there is no interval occurring, and 
we wish to know the distribution of the total number of intervals which 
lie within the interval (0, J 1 ). This is most easily done by finding the 
probability that n or more intervals are contained in (0, T). The end of 
the nth interval will occur at nd + T 1 -f . . . -f T n where the T l are indepen- 
dently distributed as (2A) - 1 % 2 with two degrees of freedom, and n or 
more intervals will occur in (0, T) if 7\ + . . . -f T n ^ T- nd. The prob- 
ability of this happening is therefore 

Pr {*?*,, <2A(T-,?)} 

p2>l(T-H<>) 

= 2-T(n)-* e-t* af- 1 dx. (2.53) 

J o 

Clearly more than [T(n$)~ l ] intervals cannot occur, where [x] is the 
integral part of x (compare Frenkel, 1946, p. 128). 

2.24 If we are to have exactly n intervals in (0, T) the joint distribution 
of their positions may be defined in various ways. Suppose that the in- 
tervals are of length d and that there are n of them. These will leave 
n+l random intervals I 19 ...,/, i+1 uncovered, and we may suppose that 
these are jointly distributed as the n+ 1 intervals formed in an interval 
of length Tnd by n points placed at random. From (2.21) this joint 
distribution is 



l). (2.54) 



Another definition which is not equivalent to this is to suppose that 
the intervals of length d are placed in the interval (0, 7 1 ) successively, at 
each stage all possible positions being taken as equally probable. In this 
case it is always possible to put a number n in the interval (0, T) where 
n is defined to be equal to [T(2d)~ l ] if [Td~ l ] is even, and [T(2d)~ l ] -f 1 
if [TVS- 1 ] is odd. However, if the positions are properly chosen we may 
be able to put in up to [T^- 1 ] intervals. Thus if we continue to insert 
intervals until no more can be inserted the total number will be a random 
variable whose distribution depends on T. The determination of this 
problem is a matter of some difficulty for which see Renyi (1958). A 
discrete analogue of this problem has been studied by E. S. Page (1959) 
(see also Downton, 1961) and this has some interesting physical applica- 
tions when generalized to two dimensions. 

Density of points in a plane 

2.25 Now consider random numbers of points in a plane so that the 
number occurring in any region of area A is a Poisson variate with mean 
AA. A may then be called the "density" of the points. Let P be any 
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point and r 1 ,r 2 , ... be the distances to the nearest, second nearest, and 
so on. The distribution of r l is then clearly 

2fazr l exp fctrl.dr l , (2.55) 

so that 2fair\ is distributed as # 2 with two degrees of freedom. Similarly 
r s will have the distribution 



(s-l\) -i exp - A^ 2 .rf - 1 dr s , (2.56) 

so that 2knrl is distributed as % 2 with 2s degrees of freedom. In fact the 
joint distribution of the quantities rf,r|,r|, ... is that of the distances of 
the nearest, second nearest, and so on points, on the right-hand side of 
a given point on a line on which random points are occurring with 
density AJT. This equivalence can be seen directly. 

2.26 These results may be used to provide a means of estimating the 
density of points in a plane by choosing random points and measuring 
distances to the 5th nearest neighbour. The mean value of if is there- 
fore s(tei)~ l and, if we are going to use a number of such estimates, 
it is better to add them together and take the mean as an estimate of 
A" 1 rather than add a number of biassed estimators of A. Suppose that 
y\> ->y n are n values of r\ and Y n~ l (y l + ... +y n )> Then the expec- 
tation of Y is (ATT) -1 so that n Y is an unbiassed estimator of A -1 and its 
variance is easily seen to be (wA 2 ) - 1 . As an estimate of A we may there- 
fore take AX = (nY)~ l and for n large this will have a variance A 2 /*" 1 . 
Furthermore 2Aw;t Y will be distributed as # 2 with 2n degrees of freedom, 
so confidence limits for A are easily found. 

In practice 2r l is often used to estimate A~* instead of nr\ to estimate 
A ~ 1 . From (2.55) we easily see that the mean of 2r l is A ~* and its variance 
is tt - 1 (4 n)X~ l . When a sum of 2r 1 > s is used to provide an estimate 
of A the efficiency of the procedure, relative to using r\> is therefore 
7r{4(4-7r)}- 1 = 0-9149. 

2.27 Applications of this result and various modifications of it have 
been used in estimating the densities of plants and trees, and the reader 
interested in the practical aspects should consult Greig-Smith (1957), 
Moore (1954), Cottam (1947), Cottam and Curtis (1949, 1955, 1956), 
Cottam, Curtis and Hale (1953), Morisita (1954), Clark and Evans 
(1954, 1955), and Shanks (1954). 

2.28 Some of these references do not show a very clear idea of the 
background theory, and a wide variety of modifications of such methods 
has been proposed without much theoretical analysis. In addition to 
the above method, which is known as the closest individual method, four 
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other methods will now be considered briefly as illustrating how the 
theory of geometrical probability may be applied. 

(1) Instead of choosing a point P other than one of the given set of 
random points (say, trees), we choose at random one of that set, and 
measure the distance to its nearest neighbour. The distribution is then 
the same as before. 

(2) As in the first method, a point P is chosen at random indepen- 
dently of the given points. Let Q l be its nearest neighbour, and ) 2 the 
nearest neighbour to Q v The distance QiQ% is used to provide an esti- 
mate of L It has several times been stated that this has the same dis- 
tribution as in the previous case, but this is quite incorrect, and we cal- 
culate its distribution as an illustration of the methods which can be 
applied in such problems. Let r be the distance PQ V and R the radius 
of a circle about Q l (Fig. 2.2). Then r has the distribution 

(2.57) 




Fig. 2.2 

and, conditional on the value of r, the probability that there is no random 
point inside a circle of radius R about the point Q : is 

e- A ^ (r ' *>, (2.58) 

where A (r, jR) is the area of the region inside the circle of centre Q l but 
outside the circle of centre P. Then the distribution of x, the distance 
of Q 1 to its nearest neighbour, is 



r r 
Jo 



~- 
dx 

This is awkward to evaluate explicitly, but the mean of QiQ 2 is not 
difficult to find. This is 



I"* [ 

J o J o 



. (2.60) 

dx 



A is equal to 

- r*{6 - sin cos 6} - R*{(f> - sin < cos 
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where 



= 



By partial integration (2.60) becomes 

A-* T dy 

A I ri 

Jo \l+7l~ 



(2.61) 



which can be shown by numerical integration to equal 1-191A . Thus 
0-8396r is an unbiassed estimate of A~* and its variance could be calcu- 
lated in a similar manner. 

(3) A random sampling point P is chosen and the nearest, Q ly of the 
random points is found. The distance, r, of Q l from the nearest point 
Q 2 to Q! on the other side of a line through P perpendicular to 
Fig. 2.3) is measured. 




Fig. 2.3 

The mean of the distribution of QiQ 2 could then be found by a similar 
kind of argument to that used above, but the calculations would be 
somewhat more complicated. 

(4) A random point P is chosen together with two perpendicular 
directions fixed in advance. The distances, y^y^y-^y^ say, of P from 
the nearest random point in each of the four quadrants are measured. 
From what has gone before we see that ^ny1(i = 1,...,4) are dis- 
tributed independently as # 2 with two degrees of freedom. Hence 



is an unbiassed estimator of A" 1 , with variance JA~ 2 , and confidence 
limits can be established as before. In ecological literature it is usual 
to use the mean of the distances and not their squares, but, as shown 
before, this is inefficient. 
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2.29 The reasons why such a variety of methods should have been 
suggested and used is not clear. Methods (2) and (3) are involved 
mathematically, their standard errors have not been calculated, and they 
must entail more labour. The best procedure appears to be to take ran- 
dom sampling points and measure the distance to the nearest, or sth 
nearest neighbours. If the points are not distributed at random all these 
methods will give biassed results, but the theory of such non-random 
distributions lies outside the scope of this book. 

Distribution of the distance of two points in a circle 

2.30 As an example of problems in- 
volving a fixed number of points con- 
sider first the distribution of the dis- 
tance between two points taken inside 
a circle. This has been obtained by a 
number of different writers and the 
easiest method appears to be the use 
of Crofton's theorem. 

We first obtain the mean of the 
distance between two points P, Q, 
each distributed uniformly over a 
circle of radius R (Fig. 2.4). 

Let A be a point on the circum- 
ference andy4O5the diameter. The 
points distant x from A in the elemental arc X Y occupy an element of 

/ x \ 

area equal to [2x cos - 1 _ --) dx and hence the mean distance from A is 
\ 2*R/ 

given by 




2tf 2 cos~ 1 -dx = Ocos 2 Os'mOdO = 

2R n Jo 



32R 



Let M(R) be the mean distance. Then using (2.13) we have 



(2.62) 



(2.63) 



Multiplying by /? 4 and integrating, we obtain 

128/? 5 
M(R)R* = h constant. 

v ' 45 

Since M(R) must equal zero when R = 0, we have 

M(R) - R. 



(2.64) 
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Now let p (x, R) dx be the probability that PQ lies in the interval 
(x, x + dx). Then p l (x > R) y the same probability when P lies on the 
circumference, is 



where = cos - 1 x (2R) ~ l . For fixed x we have 

tan 0*0 = R~ l dR y 
so that (2.7) becomes 

-| + 4p tan = sin cos 0, (2.65) 

00 7r# 

and the general integral of this equation is 

p = _ {0sin 2 0cos 2 0-sin0cos 3 + 0cos 4 0}-f Acos 4 0, (2.66) 
where A is a constant. Since p = when = we have 
p = _ {0 sin 2 cos 2 0- sin 9 cos 3 + 6 cos 4 0} 

7TJC J 

o 

= {0sin 2 OcosO-sin0cos 2 + 0cos 3 0), 

TT/v 

and integrating this we get 

p(x, R) = ~~{nx 2 + (R 2 - x 2 ) (n - 2a) - i(2/? 2 -h * 2 ) sin 2a} (2.67) 

where x = 2R cos a. 

The same result could have been obtained by a direct calculation. 
This is done by Borel (1925) who considers the analogous problems 
when P and Q are random points inside triangles, squares, and polygons 
in general. Clearly the distribution of PQ and its moments can be re- 
garded as still another of the many numbers which can be used to char- 
acterize convex domains in two dimensions (or analogously, convex 
domains in a higher number of dimensions). 

Sylvester's problem 

2.31 A more difficult problem is that known as Sylvester's. This is the 
problem of finding the probability that four points A, B, C, Z>, taken at 
random inside a convex domain, form a convex quadrilateral, i.e. none 
of the points is inside the triangle formed by the other three. Consider 
the complementary probability that the quadrilateral is not convex. This 
can occur in four different ways, according to which of the four points 
occurs inside the triangle formed by the other three. If the domain has 
area S and the mean area of the triangle is T, the probability of a convex 
quadrilateral is 

1-4TS- 1 . (2.68) 
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Since P is unaffected by the scale of the domain in which the four 
points lie, and since this domain is convex, we can imagine it included in 
a larger domain of the same shape and orientation. Hence using (2.13) 
it follows that dP = and hence that P = P l9 where P l is the prob- 
ability of the quadrilateral being convex when one of its points is con- 
strained to lie at random in the added infinitesimal part of the domain. 
Arguing as before, we then have 

P, = 1-37VS- 1 , (2.69) 

where T l is the mean area of a triangle one of whose points lies on the 
boundary, this mean value being averaged over all possible positions of 
the boundary point weighted by the amount by which the domain is in- 
creased there. We now get a factor 3 instead of 4 because only three 
free vertices may be chosen at random. 

The following argument could be carried out directly for any convex 
figure, but for the sake of simplicity we stick to the traditional exposition 
in terms of a convex polygon, one of whose vertices is the point, A say, 
which is the fixed vertex of the polygon. Let there be n vertices in all 
and join A to the othern 1 vertices, thus forming n 1 triangles whose 
areas are S v ..., SUi (Fig. 2.5). 




Fig. 2.5 

Let T tj be the mean area of a triangle of whose vertices one is A and 
the other two are points taken at random in the triangles i andy (where 
i = j or i =)=./) Then 

1 7\ = *S Sf TV + \ Si S, TV (2.70) 

'i+j 

Suppose first that i =f= /, and fi is a random point in S i9 C a random point 
in S,. If B is kept fixed, the mean area of the triangle ABC is clearly 
ABGj, where G,- is the centre of gravity of the triangle j. Then varying 
B the mean value of ABC is AGjG r 
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If i j we first notice that T u must be of the form XS iy where A is 
independent of the shape and size of S z . This follows because we can 
transform any triangle into any other triangle by projecting from one 
plane perpendicularly on to another and then changing the scale. In 
doing this the areas of S t and ABC are multiplied by the same factor, 
and the probabilities of points lying in corresponding regions remain 
equal. If AXY is any triangle, PFthe middle point of XY, G 15 G 2 the 
centres of gravity of AXW, AWY y and S the area of AXY, we get 

SUS = 2(15)^15+ ^(AGiGJ. (2.71) 

Moreover AG V G^ is easily seen to be *S. Hence, solving for A we get 



~ 27 
so that TV = S t . (2.72) 

2.32 By proceeding to the limit, (2.70) can be turned into an integral 
for any convex region (Fig. 2.6). Suppose that A is a point on the 




Fig. 2.6 

boundary of the region and the region is defined by the function 
p (0) (0 ^ ^ n), where p (0) is the distance of the boundary from A 
along a line making an angle with a tangent at A. Then (2.70) becomes 

sin 1 - $ \ dd d<f>. (2.73) 







Using this formula we can evaluate 7\ for any particular case. 

2.33 Consider first the triangle. Let AXY be a triangle and suppose 
that its area is to be increased by moving XY parallel to itself. Then if 
W is a point on XY distant x from X we must evaluate T when W is 



45 

the fixed vertex of the random triangle, and then average over all posi- 
tions of W obtained by taking x to be uniformly distributed on the line 
AT (Fig. 2.7). 

A 




Let S l and 5 2 be the areas of AXW and AWY and S(= S L - 
be that of AX Y. If XY -= a and the height of the triangle is A, we have 
S = \ah, S l = \xh, S 2 - \(a-x)h, and 

in the previous notation. Substituting the values of the J"s we get 

T = - { 4 x 3 -f ^ (a xY + ? fl#(fl A;) ^ (2 74) 

We then have 

(2.75) 



so that 



P= 1- 



= -| = 0-6667 



(2.76) 



2.34 For a circle we may imagine the enlarged domain to be a con- 
centric circle. All positions of the fixed point on the circumference are 
then equally likely, and since the figure is circularly symmetric no 
averaging is necessary. Let a be the radius of the circle. Then 
P = 1 3 T l (na 2 ) - 1 . A being a point of the circumference, we use it 
as the origin of polar coordinates, measuring angles from the tangent at 
A as before. Then 



f n C n 

si 
J o J o 
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Then P = l-35(\2n*)- 1 = 0-7045... . (2.78) 

Deltheil (1926) carries out a similar calculation for a general convex 
quadrilateral. For the particular case of a parallelogram or rectangle 
P = fe~ = 0-6944. It can be shown that the value for any convex 
quadrilateral lies between this value and 0-6667, the value for a triangle. 

For a regular hexagon P = 683 (972) -1 = 0-7029 and for an octagon 

2851 +2013 V2 
P ~ 4032 + 2880 V2 07030 ' (2<?9) 

Thus for any convex domain P is a characteristic number depending 
on its shape but not its size. Deltheil proves that for all convex figures 
P is not less than its value for a triangle and conjectures that for all such 
figures it is not greater than the value for a circle (or ellipse). This has 
apparently not yet been proved. 

Non-overlapping circles on a plane 

2.35 In analogy with the case of random non-overlapping intervals in 
an interval it would be desirable to define random non-overlapping 
circles, all of the same size, in a plane. Such a definition has not been 
obtained and leads to great difficulties, as the following argument shows. 
Suppose that all the circles have diameter d and one has a centre at a 
point P. No other circle can have its centre less than 2d from P, but 
the likelihood of such a circle having its centre in a small region of area 
dS at a distance R > 2d will not be proportional to dS alone and will 
not in general be a monotonic function of R. This may be seen by con- 
sidering the limiting case when the density of the circles becomes so 
large that they are nearly arranged in the manner of "closest packing", 
i.e. with their centres nearly at the vertices of a lattice of equiangular 
triangles. The probability of lying in a region dS distant R from a given 
circle will then be a strongly peaked function of R. 

2.36 Another method of approaching this problem, in the case of n 
non-overlapping circles in a given domain Z), would be to suppose that 
the circles are placed in the domain one after another, all available posi- 
tions at each stage being taken as equally probable. (This method is 
similar to that discussed above for intervals on the line.) However, it is 
then not hard to see that the distributions of the circles are not all the 
same and depend on their order in the sequence in which they are placed. 

Similar difficulties arise in the case of random non-overlapping 
spheres in space, as will be discussed in a later section. 
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2.37 When the density of the circles is low no great error will occur if 
their centres are regarded as points distributed at random on the plane, 
for the chances of two centres being closer than 2d will be small. One 
interesting application of this occurs in forestry (Bitterlick (p. 148), 
Grosenbaugh (1952a, 1952b) and Shanks (1954)). Suppose circles, 
representing tree trunks, are distributed at random sufficiently sparsely 
for the above approximation to hold and such that the expected number 
in any area dS is A dS. Suppose further that the probability distribution 
of the diameters d of the trunks is /(</). Then the mean area covered by 
tree trunks is 



( 

Jo 



x*f(x)dx. (2.80) 



Suppose that a random point P is chosen and that the number, N, 
of trees whose trunks subtend a greater angle than a at P is counted. 
Such a count can easily be made with a suitable sighting angle and 
instruments for this purpose have been constructed. A tree will thus 
be counted if its distance R is such that 

d> 2#sina. (2.81) 

Hence for a given d in the range (</, d+ dx) the average number of such 
trees counted will be 

-Y 



o -- , , (2.82) 

^2sina/ v ' 

and averaging over all values of d, the average number counted will be 

_^A_ r x z f(x)dx, 

4sin 2 a Jo 
= (sin 2 a) ~* (mean area of trunks per unit area). (2.83) 

In practice a may be chosen to be 1 44' and then ten times the observed 
number of trees will give an estimate of square feet of basal area per 
acre. 

Random points in three dimensions 

2.38 We now consider random points in. three dimensions and suppose 
again that we have a Poisson field such that the number of points occur- 
ring in any region of volume V is a Poisson variate with mean W. Let 
P be any fixed point and r lt r 2 , ... the distances to the nearest random 
point, the second nearest random point, and so on. The probability 
that there is no point in a sphere of radius r about P is 

exp-Anr 8 , (2.84) 

and so the distribution of r l is such that Xnr] has a negative exponential 
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distribution, i.e. 

4Awf expf-S-Aarftrffv (2.85) 

This may also be expressed by saying that fA^rr? has the # 2 distribution 
with two degrees of freedom. Similarly the distribution of r s is 

3(^n) s r(s)- 1 exp J-A^rJ'" 1 ^. (2.86) 

This can be seen directly by observing that the probability of a point 

occurring in the spherical region (r, r + dr) is 4A.Jir 2 dr + o(dr) and hence 

AArcr?, ^n(rl-r\) y ... (2.87) 

are distributed as the intervals to the right of a fixed point on a line on 
which random points are occurring with unit density. 

These distributions have long been known but repeatedly redis- 
covered (see, for example, Hertz (1909), Pepper (1929)). They have 
some interesting applications in Astronomy of which we consider two 
briefly. 

Olbers' paradox 

2.39 The first of these is Gibers' paradox about the brightness of the 
night sky (Bondi (1952) ). This is a paradox which arises from assuming 
that there are an infinite number of stars uniformly distributed at ran- 
dom in an infinite space. It is easy to see that the brightness of a star 
as seen by the eye, divided by the solid angle subtended by the star, is 
independent of the size and position of the star, provided the latter has 
a constant brightness at its surface. Suppose for simplicity that all 
stars have equal diameters d and the same surface brightness, and let A 
be their mean density in unit volume. If a fraction Y of the total solid 
angle 4jr is covered by star disks lying within a distance R from the 
observing point P, there will be a solid angle 4ft (1 Y) which would 
be dark if there were no stars beyond R. The expected number of stars 
in the range (R, R + dR) is ^XnR^dR and these subtend a total solid 
angle at P equal to fai 2 d 2 dR, of which a fraction Y will on the average 
be screened by nearer stars. Hence the increase in the solid angle 
illuminated due to stars in (R, R + dR) will be A(l- Y)n 2 d*dR y and 
we have 

dY 



y (2.88) 

uJK. 

so that 

log(l- F) = const. - A(l - Y)7r 2 <5 2 #, (2.89) 

and hence as R > oo, F > 1 . This argument applies also to any par- 
ticular solid angle with vertex at P, and the night sky should therefore 
be uniformly bright, which is in contradiction with observation. 
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The reader may easily verify that if there was a uniformly distributed 
interstellar dust which could act as a screen the above conclusion would 
not hold. This is not accepted as an explanation since the dust would 
be expected to reradiate the energy it receives. Bondi gives an exten- 
sive discussion of the various physical theories which have been put 
forward to explain this result. 

Holtsmark's problem 

2.40 A more difficult problem solved by Holtsmark (1919a, 1919b, 
1924) see also Chandrasekhar (1943a, 1943b, 1944a, 1944b) is that 
of the distribution of the gravitational force at a point in space due to 
a random distribution of stars. If the force is written as a vector 
F = (F 19 F 2 , F 3 ) we write this distribution as 

WWdF^dFidFi. (2.90) 

Suppose the stars are distributed in the Poisson manner in such a way 
that the expected number in any volume V has mean XV. If there are 
N stars in a sphere of radius R about the point O at which the force is 
to be measured, the force will be 

N M- N 

F=G2 p^r,- X F,, (2.91) 

1=1 I TI I t=-i 

where M t is the mass of the ith star and r t = (x iy y i9 zj) is the vector 
from to the centre of the star. We obtain the required distribution 
by letting the N stars take positions randomly distributed throughout 
the sphere and then letting R and TV tend to infinity in such a way that 

>-jr/Z 3 A = N. (2.92) 

Then if F is the force at O due to the N stars, its distribution will be 
given by 



r I"" r c- 1 "- v A ll 

7l J co J oo J-oo 



where p = (p ly p 2 , p 3 ) is a vector, and A N (p) is the characteristic func- 
tion of the distribution of F. Since F is the vector sum of the N forces 
exercised by the N stars, this characteristic function will be the product 
of the characteristic functions of the distributions of the forces exer- 
cised by each of the N stars and will be given by 

fi (""rfM, fff* e*- p T(M)<fo<<fy,<fe<, (2.94) 

i = l JO J J J |r| = 

where we have now averaged over the distribution of the masses of the 
G.P. D 
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stars, it being assumed that each such mass is independently distributed 
in a distribution with probability density r(M). 
We can therefore write A N (p) as 

- 3 j"rfM [ [ [ V r(M)dxdydz\\ (2.95) 

JTjR 3 Jo JJJ|r|-0 J 

where 

!r.- ( 2 - % ) 



Letting /? and N tend to infinity we then get 



W(V) = -~ s - r A(f)dp l dp t dp > , (2.97) 



where 



/^ f fff 00 ^1 4- 7 ^ 3 ^ 

__ vr/ ..... ,- dM \\\ &'*v(M)dxdydzy~^~'. (2.98) 

/<->oo {4jlR 3 JO J J J |r| -0 J 



|r| 

By definition 

3 



|r|-0 

so that, by subtracting, we can write A(p) as 
lim (l- 3 - [ dM\ \ \ R T(M){l-^^}dxdydzV n1 ^. (2.99) 

7?->oo L 4jT/? 3 J JJJ|r|-0 J 

<^ is O(| r ~ 2 ) but since the integral over the sphere is the integral of 
an odd function of #, y and s, the integrand is effectively O(| r | ~ 4 ), and 
hence if the integral is taken over the whole of space it is absolutely 
convergent. We can therefore write 



A(f>) = lim (i__ : Lr</Mf f r 

R->*> I 47T/? 3 Jo J J J |r| 



|r| 

= exp{-nC(p)}, (2.100 

where 



C(p) = JAf r(M){l-expip.<f>}dxdydz. (2.101) 

Jo J J J |r| -0 

Let the vector^ have components <f> lt <^ 2 , ^3 and change the variables 
of integration in C(p) from #, y, z to (f> ly < 2 , ^> 3 . It is not hard to verify 
by calculation of the Jacobian, that 

dxdydz = - \(GM)* \<f> l" 8 ^^^ (2.102) 

so that 

C(p)= IC/- 
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<M<M<A3, (2.103) 

where //| is the expected value of M*. The integrand is unaffected by 
replacing <f> by 0, so that we can write it as 

^^,. (2.104) 

We now transform to three-dimensional polar coordinates, taking 
the z axis as the direction of p, and C(p) becomes 

| 3 ff ( - *{l-cos(*0}* -*dwdtds 9 (2.105) 

Jo J -iJo 

where ^ w ^ 2ji> 1 ^ t ^ 1, and ^ z < oo. Integrating over 
w and we get 



8 | p |l (2.106) 
o 

Writing 

a = T V(2rci)Vj, (2.107) 

we get 

W(F) = cxpf-ip.F-nIpp)^^,^. (2.108) 



To evaluate this we again change to polar coordinates, using the direc- 
tion of F as the principal axis, thus obtaining 



o -1 



p|0|p|sin(|p||F|)</|p| 

= __ - _ [ exp(-a^ 3 \F\~l)xsmxdx. (2.109) 

2n | F | 3 J o 

a* has the dimensions of a force, and expressing F in terms of this as 
a unit so that | F | = /? (& the distribution of | F | becomes 

W() = H(p)/4na*p* (2.110) 

where 



In his (1943) paper Chandrasekhar gives a table of H(j$) for 
j8 = 0(0-1) 3-0(0-2) 3-4(1-0) 4-4(0-2) 10-0(5-0) 50-0(10-0) 100-0. 

2.41 In fact this distribution is not substantially different from the 
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distribution of the force at due to the nearest star alone. Using the 
previous results this is easily seen to be 



(2.112) 

which is easily tabulated. Neither of these distributions can be expected 
to be very accurate when F is large since stars have a non-zero volume 
and their centres cannot therefore approach arbitrarily close to one 
another. 

2.42 The same distributions apply in dealing with the electric field at 
a point in a gas composed of simple ions. A more difficult question in 
both these physical problems is to study F when regarded as the variate 
of a random process resulting from the random movement of the stars 
(or ions) in space. This is discussed in great detail in Chandrasekhar 
and von Neumann (1942, 1943). 

Non-overlapping spheres 

2.43 In the derivation of the Holtsmark distribution we have assumed 
that the diameters of the stars are so small compared with the average 
distance to a nearest neighbour that the assumption of a Poisson dis- 
tribution gives a good approximation. Here again, as in the case of non- 
overlapping circles, it does not seem possible to give a simple definition 
of a random distribution of non-overlapping spheres. Such a definition 
would be of considerable interest in the theory of liquids (Hildebrand, 
1944, Frenkel, 1946). 

2.44 Suppose that O is the centre of a sphere and that all spheres have 
the same diameter d. No other sphere can have its centre at a distance 
less than 2d from O. If the average density of spheres per unit volume 
is A and if the effect of the non-zero diameters is ignored, we should 
expect the average number of spheres whose centres lie at a distance in 
the interval (r, r + dr) to be 4A.Jir 2 dr. The deviation from the ideal state 
may then be accounted for by introducing a factor W(r), so that the 
expected number is 

4A^W(r)r*dr. (2.113) 

In practice, this factor can be determined for the distances between 
molecules in a liquid by X-ray analysis but is very difficult to derive 
by theoretical methods (e.g. Kirkwood and Boggs, 1942). 

2.45 An interesting experimental attempt to determine W(r) in a 
situation of plausible application was made by Hildebrand and Morrell 
(1936). A number of gelatine spheres, including a few which were 
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coloured, were placed in a cubic glass vessel containing a gelatine solu- 
tion of the same density and refractive index. The vessel was repeatedly 
shaken and photographs taken in two directions at right angles so that 
the positions of the coloured balls could be measured. In this way an 
approximate Monte Carlo determination of W(r) was made. 

2.46 As the density of the spheres increases, the distribution tends to 
a discrete distribution which is obtained by considering the set of 
distances from the centre of one sphere to its near neighbours in the 
well-known "closest packing" of spheres. This is obtained by arranging 
spheres in successive layers in each of which their centres form a 
hexagonal lattice, and such that successive layers lie in the hollows of 
the layer below. It is a curious fact that although this has been believed 
for a long time to be the closest packing of spheres no rigorous proof 
has ever been given. 

Distribution of two points in a sphere 

2.47 It is easy to invent problems involving a fixed number of points 
in three-dimensional regions similar to those already discussed above 
in two dimensions. We consider only one of these, that of the prob- 
ability distribution of the distance between two points taken at random 
inside a sphere. It is, however, convenient to consider the more general 
problem of the distribution of the distance between two points in a 
hypersphere in n dimensions. This problem is solved in Deltheil (1926) 
by using Crofton's theorem, and by Hammersley (1950) using direct 
integration. We follow another method due to Lord (1954) using 
characteristic functions. 

2.48 Let P and O be the two points in the hypersphere and suppose 
they are determined by vectors r x and r 2 . Then we require the distribu- 
tion of | r l r a | when P and Q are uniformly distributed over the hyper- 
sphere and this is the same as the distribution of | R | where R = r l + r 2 . 

Let r be the vector determining a point which has a uniform dis- 
tribution in the sphere and write P(r)dr for the probability that 
r < | r | < r + dr. If p = (p ly p 2 , /o 3 ) is an arbitrary vector, the charac- 
teristic function of r is 

(e /pr ), (2.114) 

and for a spherical distribution this is a function of | p | only, so that the 
characteristic function of r = r is 



00,)= ( 

J 



(2.115) 
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where p = | p | and 

A a (*) = r(l+a)(i*)-V.(*). (2.H6) 

This has an inversion formula 

P(r) = 2'-'<'<{ra)}- 1 JjMVi.-i('7Wp)4>- (2.H7) 

Since / a (AC) is a Bessel function, these formulae are Hankel transforms. 
For a uniform distribution in a sphere of radius a we have 

P(r) = nr"- 1 *-" (0 = r <a), 

= (r > a). (2.118) 

The characteristic function is then 



(2.119) 

The characteristic function of |r| = |fi + r 2 | is therefore the square 
of this and 

P(r) = r(l + J)(^y" J V l "y?M7|,,-.iM4. (2-120) 
Substituting in a standard formula (Watson, 1944, p. 411), we find that 



where ^ A ^ TT and sin I A = -. Put / = cos 2 1^ and this becomes 

2a 

P(r) = r t - 1 fl-/ / ,(iH+J,i), (2.122) 

where /* = 1 r 2 (4a' 2 )~ 1 and I x (p, q) is the incomplete Beta function 
defined by 



o 
If we put 2ay = r we get for n = 1,2, and 3, 

P(r) = 2(l-j), (2.124) 

P(r) = -y {cos -> - y (1 -^)}, (2.125) 

7r 

and 

P(r)= 12y(l-.y)(2+y), (2.126) 

which correspond^ to the cases for a segment, a circle, and a three- 
dimensional sphere. In particular (2.125) is another way of writing the 
previously obtained result for a circle. 
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2.49 From this we can obtain the moments, and the rth moment of 
PQ in a sphere of radius a in a space of n dimensions is 

' 

' ^ ' ' 

When n becomes large it can be shown that the distribution of PQ tends 
to normality with mean a2* and variance a 2 (2ri) - 1 . For other work on 
the problem see Watson (1959). 

Clearly this problem can be extended to other domains, and in the 
case of a cylinder the results have practical applications (see Ham- 
mersley, 1951a, 1951b, 1952). 

The distribution of the distance between points constrained to lie 
on the surface of a sphere could be considered in the same way. 
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CHAPTER 3 

RANDOM LINES IN A PLANK AND IN SPACE 

Regions determined by random lines in a plane 

3.1 We have already seen in Chapter 1 that the most natural coordi- 
nates to use for the description of lines in a plane are (/>, 0), the polar 

coordinates of the foot of the perpen- 
dicular from the origin on to the line. 
With this representation the area ele- 
ment in the parameter space which is 
invariant under Euclidean transforma- 
tions is dp d6. Hence we suppose that 
such lines are distributed "at random" 
in the sense that we have a Poisson 
field with density hdpdd. The lines 
from a field of this type will divide 

. , the plane into an infinite number of 

Fig. 3.1 it- r- -3 1 

polygonal regions, as in l H ig. 3.1. 

3.2 The probability distribution of the areas of such regions is of very 
great interest, but its analytical form is not known exactly. Some facts 
about it are, however, known (Goudsmit, 1945). One method of ap- 
proach would be to consider the intersections of a large number of ran- 
dom chords of a circle, but a more elegant method, due to Goudsmit, 
is to consider this as the limit of another problem, namely that of the 
distribution of the areas of the regions on the surface of a sphere which 
are formed by a large number of random "great circles", i.e. circles 
formed by the intersections with planes through the centre of the sphere. 
Such a circle can be defined by the coordinates of one of its "poles" on 
the surface of the sphere, and we can then suppose that each of these 
poles is uniformly distributed over the surface of the sphere. 

3.3 Suppose the sphere has unit radius, and let n be the number of 
great circles. For n equal to 1 the number of distinct regions is 2, for 
n = 2 it is 4, for n 3 it is 8, and by induction it is easily seen that in 
general the number of areas is 2 -f n (n 1) since when n is changed into 
n+ 1, 2n new regions are added. Similarly for n = 2, 3, 4 the number 
of sides (taken as equal to the number of segments of the great circles) 
is 4, 12, and 24 respectively, whilst for general n it is 2n(n- 1). Thus, 
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as n becomes large the average area is asymptotically equal to 4jm~ 2 , 
whilst the average number of sides of each quasi-polygonal region will 
be the limit of 



which is 4. (The numerator is here 4n(n 1) and not 2n(nl) since 
each segment of a great circle is a side to two polygonal areas and must 
be counted twice.) The total perimeter of the 2 + n(n 1) regions is 
twice the total length of all the great circles, i.e. it is \nn, so that the 
average perimeter of a region is asymptotically equal to ^nn~ l , and the 
average length of each side is nn - 1 . 

3.4 We now suppose that n becomes very large and consider the dis- 
tribution of regions in a small circle of radius r on the surface of the 
sphere. The number of great circles which intersect this small circle 
will be asymptotically equal to rn. Proceeding to the limit and con- 
sidering the region inside this small circle as approximately a plane 
circle, we find, on reseating, that the average area of the regions in a 
plane formed by a Poisson field with density hdpdO is (jrA 2 )" 1 . In an 
unpublished paper Mr. D. G. Kendall has also found the second 
moment of the area, but the exact distribution is still unknown. A 
similar investigation could also be made for the regions formed by 
random planes in space, as we shall see in the next chapter. 

Convex figures in a plane 

3.5 A large number of interesting problems in geometrical probability 
are connected with convex figures. For the most part they involve a 
fixed number of intersecting random lines, usually one or two. The 
resulting theories are closely related to the large subject of convex figures 
in general, of which a detailed account has been given by Bonnesen and 
Fenchel (1948) which covers the literature to 1933 (for subsequent work 
see Hadwiger, 1955). 

The theory of convex figures in geometrical probability is mainly due 
to Barbier (1860), Crofton (1869, 1877, 1885) and Sylvester (1891). 

3.6 A convex figure in a plane is a set of points, usually closed, which 
is such that if P l and P 2 are any two points in it, so also are all the points 
of the interval P^JP^ If the set is bounded, its boundary is a closed curve 
with a tangent almost everywhere. It may be defined by the pair of 
periodic functions (x(t) 9 y(t)} (0 < < 1, say) giving the coordinates of 
a point on the boundary where x (1) = x (0), y (1) = y (0). In many cir- 
cumstances it is more convenient to define it by giving its "function of 
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support" with reference to an origin O. This function, H(0\ may be 
defined as the distance (Fig. 3.2) of O from the tangent perpendicular 

to a line radiating from the point 
O in the direction 0, i.e. the 
length of the pedal in terms of 0. 
O is usually taken inside the con- 
vex figure, but if outside only a 
slight variation of the definition 
is necessary. Then H(0) is a 
periodic function of with 
period 2n, and the function 
H(6) + H(7i + 0) is the length of 
the interval which is the projec- 
tion of the convex figure on any 
line in the direction 0. This length may be described as the "thick- 
ness" of the figure in the direction 0. 

3.7 Since we have taken Mp dO (p > 0, ^ ^ 2n) as the element of 
measure in the set of all lines in a plane, the measure of the set of all 
lines intersecting a convex figure C will be 

A \*{H(0) + H(0 + n)}dO = A [~*H(0)dO. (3.1) 

Jo Jo 

We shall show that in fact this is equal to AL, the length of contour of C, 
This follows at once when we observe that when increases to + dO. 
the corresponding element of length is dL = H(0)dO. Thus we can 
write 




if. 



dpdO = L y (3.2) 

1 c 

where the integral is taken over the region in (p, 0) space for which the 
line cuts the figure. It immediately follows from this that if a convex 
figure C 2 of length L 2 is contained in a convex figure C l of length L 19 
the probability that a random chord of C x cuts C 2 is L^L^ 1 . Notice 
that this probability is independent of the position of C 2 relative to C v 
Furthermore we may suppose C 2 to consist of a straight segment of 
length /. This can be regarded as the limit of a convex region of peri- 
meter converging to 2L The probability that a random chord of C l 
intersects this segment is then 2lL^~ l . 

3.8 This can be looked at from another point of view. .Consider a 
straight segment of length / in the plane and its projection on a line in 
a direction making an angle with it. This is P(6) = 7| cos0 | and the 



mean value of P(0) is 



jr 

2n J 



/|cosO 



.* 

n 
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(3.3) 



This formula was originally due to Cauchy and has been used by Stein- 
haus (1930) as the basis of a method for measuring the length of a curve 
observed under a microscope. 




Fig. 3.3 

Suppose (Fig. 3.3) that the curve L has length L and that its total 
projection on a line in the direction is measured so that each sub- 
interval of projection is counted as many times as there are points on the 
curve which project on to it. Thus, in the case shown in the figure, the 
projection would be taken as 

P(0) = 2AB + BC+3CD + DE. 

Then, adding the contributions of each small element of the curve, we 
see that 



L = 



(3.4) 



where P is the mean of P(0) taken over all directions. For any given 
direction P(6) is easily found if the microscope has a calibrated moving 
stage. To find P such measurements could be taken with n values of 
spaced 2nn~ l apart. When n is odd this is the same as taking the mean 
of 2n observations spaced nn ~ l apart, so we consider only the case where 
2n observations are taken 2n(2n)~ l apart. An exact upper bound for 
the error is then easily obtained by using elementary vector analysis, for 
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if P 2n is the mean based on 2n observations, 

-i / TT \ -i 

(3.5) 



~ o n -- . 

2n \ 2n/ ~ \ 2n/ 

Even for small w, the resulting error is small. 

3.9 Instead of supposing that the figure O 2 lies inside C l suppose that 
it lies entirely outside C\. We determine again the probability that a 
random chord of C l intersects C 2 . Draw the four common tangents 
AD, CD, EG and FH, the latter two intersecting between the contours 
in a point O (Fig. 3.4). Define the convex figures 1\ and F 2 to consist 



A 




Fig. 3.4 

of the figures C l and C* 2 together with regions interior to the two tan- 
gents from O to each, so that I\ is AEOHD and F 2 is BFOGC. Let L 12 
and L' 12 be the lengths of chords tightly drawn about C l and C 2 and either 
not crossing over, or crossing over at O. Then we can write 

(Measure of chords of Fj) + (Measure of chords of F 2 ) = (Measure 
of all lines which meet either I\ or F 2 )-f( Measure of all lines which 
meet both F 5 and F 2 .) (3.6) 

The left-hand side of this equation is equal, from what has been said 
before, to the sum of the perimeters of F x and F 2 , i.e. to L' 12 (taking 
A = 1). The measure of the lines which meet either T 1 or F 2 or both is 
L 12 . Thus the measure of the lines which meet both is L' 12 L 12 , and 
since the measure of all chords which meet C l is its perimeter L l the 
required probability is 

(L'v-L^L^. (3.7) 

3.10 Similarly we may consider two convex figures which overlap. The 
number of intersections of the two boundaries may be any even number, 
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but by applying the above type of argument we can see that the prob- 
ability that a random chord of C l intersects C 2 is 

(Lj + L.-L^Lr 1 , (3.8) 

where L l and L 2 are the perimeters of C l and C 2 and L 12 is the length 
of a chord tightly drawn about both, i.e. the length of the "smallest 
convex cover" of both C and C 2 . 

3.11 Now consider pairs of lines intersecting a convex figure C of 
length L. Since the measure of each is L and they are taken as indepen- 
dent the appropriate measure* is L 2 . To obtain the probability that two 
random chords of C will intersect inside C we must find the measure of 
all pairs of lines which satisfy this condition. More generally, consider 
another convex figure C l contained in C (and possibly identical with it) 
and determine the probability that two random chords of C : intersect 
inside C 2 . Let AB be a chord of C l which intersects C 2 and consider 
the measure of the set of chords of C l which intersect AB inside C 2 . If 
.v is the length of the intercept of AB by C 2 the measure will be 2x, and 
the probability that a random chord of C L will intersect this intercept is 
2xL l ~ l . If (/>, 0) are the coordinates of the line making the chord AB 
the probability element associated with this line is L^dpdO, and hence 
the overall probability that the two chords intersect is 

P - ( f 2xL i -*dpdO (-00 < p < ao,Q ^ ^ 2n\ (3.9) 

where the integral is taken over all positive and negative values of p and 
all values of 0, such that the line intersects C 2 . Integrating first with 
respect to p, we see that 

xdp = A 2 , the area of C 2 . (3.10) 



Thus P = 2jiA 2 L l - 2 . (3.11) 

As a special case we may assume C and C 2 coincide, so that 

P = 2nA l L l ~\ (3.12) 

and for a circle P is equal to | . 

Since P < 1 we have 2n A l < L. This is reminiscent of the classi- 
cal isoperimetric inequality which states that for any closed non-over- 
lapping curve with A as its internal area and L as the length of its 
perimeter, 

4jtA < L\ (3.13) 

f Deltheil takes L 2 since he does not give different identities to the two 
lines, but it is a little clearer to distinguish them and hence each configuration 
of two lines is counted twice. 
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From this it follows that the probability of intersection of two random 
chords of a convex figure never exceeds |, the value for a circle. 

3.12 The other interesting thing about (3.11) is that it shows that the 
probability depends only on the area of C 2 and not on its position or 
shape. Hence if we have a large number of random chords of C x their 
intersections will be uniformly spread over the figure with a "density" 
2n. 

3.13 (3.11) can also be used to give another proof of Goudsmit's result. 
Take a circle of unit radius for simplicity and suppose it has n random 
chords. If we add another chord the expected number of intersections 
is \n. If we count as regions all those areas which have part of the cir- 
cumference for a boundary, the number of new areas resulting when there 
are N intersections is N+l, and hence the total expected number of 
areas with n random chords is 

(3.14) 



The number with curved boundaries is O(ri) and hence it does not 
matter whether we count them or not. The asymptotic number of areas 
is Jw 2 , and from this we deduce the result obtained before to the effect 
that a Poisson field of lines with density element Xdp dO divides the plane 
into regions of average area (zd 2 )- 1 . 

Crofton's theorem on convex figures 

3.14 We next consider the density of the intersections of chords of a 
convex figure outside the figure. We do this by determining the prob- 
ability that two random chords of C x will intersect inside a convex figure 
C 2 (outside C x ) which is such that its area and dimensions are small 
(Fig. 3.5). 




Fig. 3.5 
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Let AB be the intersection with C 2 of a chord of C\. From A and B 
draw tangents of C v Then from (3.7) the measure of all chords of C r 
which also intersect AB is the difference between the lengths of chords 
drawn about C l and AB y either crossing or not. Since C 2 is vanishingly 
small, the tangents from A and B are parallel in pairs. Let BK and BH 
be the perpendiculars from B on the tangents through A. The required 
measure is then 2ABAHAK. Let (9, O ly 2 be the directions of AB, 
AK and AH so that 

2AB-AH-AK = 4J3[2-cos(0-0i)--co3(0-O a )]. 

Integrating over all possible values of the parameters defining the line 
AB y we get 

f ' [ AB{2-cos(0-0 1 )-cos(0-0 2 )}dpdO. (3.15) 

Integrating over p we get 



= area C 2 , (3.16) 

so that the "density" of intersections of pairs of chords of C l is 

ro* 

Jo, 
= 2[(0 1 -0 8 )-sin(0 1 -0 1 )}. (3.17) 

The measure of all pairs of chords of C x which intersect in a region D 
outside Cj is therefore given by 

2\ { (a-sinaWffrfy, (3.18) 

J D J 

where a is the angle subtended by C l at the point (x,y). In this formula 
D could be taken as the whole of the plane outside C t . Since the measure 
of all chords of C 1 is L 2 , we have 

2 [ f (a-sina)<fo</v = L*-2nd > 0, (3.19) 

J J fl_(7 

where ii is the whole plane. This is Crofton's "first" theorem on convex 
figures. 

3.15 Lebesgue (1912) has given an alternative derivation of (3.19). 
Consider straight lines / and /' which are possibly restricted to lie in cer- 
tain sets of lines E and E' y and suppose that their equations are written 

(/) # cosO-f ysinO = p, 
(/') #cos0'+j;sin0' =/>' 
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Then the measure of their points of intersection is 

[[ [[ dpdQdp'dO'. 

Keeping and 0' constant, we can regard/) and/)' as functions of (x,y), 
the coordinates of the intersection of / and /'. The Jacobian of the trans- 
formation is 



3(p,p') 



d(x,y) 



cos sin 
cos 6' sin 0' 



= | sin (0'- 0)|, 
and the integral becomes 

f [ | [ [ | sin (6' - 0) | dd dO'\ dx dy, 

so that the "density" of the intersections at the point (x, y) is the integral 

\sm(0'-6)\dOdO' 



taken over the corresponding domain of values of and 6'. When the 
sets E and E" consist of lines intersecting the convex domain C ly and 0' 
will both vary over a range to 6 Q + a. Since the integral is independent 
of Q we can take the latter as zero and obtain 

PW(f sin (0'- 0)dO- T sm(0'-0)dO 

J U J 6' 

= f " {1 - <x>s0'- cos (a- 0' 

J 

- 2a-2sin0, 
as before. 

3.16 In Lebesgue's method of proving Crofton's "first" theorem of 
convex figures a measure was set up for pairs of lines in the plane. The 
same type of method can be used for pairs of points. Suppose we have 
two points P and P' with coordinates (x,y) and (x' y y') restricted to lie 
in two sets E and " (which may be the same). Then the natural measure 
of pairs (P, P') will be 

Jf J f dxdydx'dy'. (3.20) 

Let #cos0-f;ysinO = p 

and xsinOycosO = 

be the equations of the lines PP' and the perpendicular on PP' from the 
origin of coordinates. If (f , rj) is a point of the plane, its distance from 
the line #sin0 ycosO = is given by f sinfl 77 cos with a suitable 
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convention as to the sign of this distance. Hence we can replace the set 
of coordinates (x,y, x' y y r ) of the pair P, P' by the four coordinates 
(p, 0, p, p'\ where 

x = psinO+pcosd, 
y = p cos +p sin 6, 
x f = p'sinO+pcosd, 
y' = p' cos 6 +p sin 0, 

and the Jacobian is easily found to be p' p. Thus the above integral 
becomes 

and the corresponding range of integration is often simpler than the 
original one. 

Crofton's second theorem on convex figures 

3.17 To illustrate this, suppose that C is a convex figure of area A. 
Then the measure of all pairs of points inside the figure is A 2 , each pair, 
in the geometrical sense, being counted twice. If AB is a chord, the 
relative density of pairs of points on this chord will be given by 

\p'- P \dpdp', (3.22) 

where the pairs have corresponding coordinate value (p y p'). If a and b 
are the values of these corresponding to A and B the integral is easily 
verified to be equal to \l\ BJ where 1 AB is the length AB. Thus the in- 
tegral representing the measure of all pairs of points in C can be written 



if J 



(3.23) 



where 1 AB is the length of the chord intercepted by the line whose polar 
coordinates are (/>, 0). We have already shown that this measure is equal 
to S 2 , and so we obtain the equation 

- 3S 2 , (3.24) 

which is Crofton's ' 'second* ' theorem. Notice in particular that the 
dimensionality of the formula is correct, both sides being of the fourth 
order in length. 

3.18 This result is easily generalized to obtain the mean of the nth 
power of the distance apart of two points chosen at random in a convex 
contour. Proceeding as before, this mean value is 
G.P. E 
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rlAB 



IAB 

^ 

(3 ' 2S) 



For a circle the mean distance between two points is thus 

128R 



* 



C H 
J. 



as obtained before (Chapter 2). In a similar way we could obtain the 
characteristic function of the distribution of the distance of two points. 

3.19 Crofton has given other theorems (1877) on mean values of which 
the following is typical. Consider a convex figure C which has a centre 
O, i.e. relative to this centre H(0) = H(0 + TT), and every chord through 
O is bisected at O. Let be the direction of such a chord and G the 
centre of gravity of one of the two halves into which the figure is divided 
by the chord. Then we shall show that the mean distance of points in 
the figure from O is one-quarter the length of the curve traced out by G 
as increases from zero to 2n. 

Let X Y be the chord through O and suppose that #, y are the co- 
ordinates of G referred to OF as jc-axis. If XOY is rotated through a 
small angle to a new position X'OY' it is clear that the new position 
of G, relative to the old axes, is given by 

x' = x + dx = A- l {Ax+$aw + %aw}, (3.27) 

where A is the area of the half-figure, a is the distance OX O F, and 
w is the "weight" associated with the regions XOX' and YO Y'. Clearly 

w = 
and 



r 

= 

Jo 



fc 



Similarly y' = y + dy = y. 

Thus G is displaced parallel to XO Y, and if we denote an element of 
its path length by ds we have 

ds = dx = %a*A- l dO, 
so that the radius of curvature of this path is 

~ 
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where 1(6) is the length XY = 2a. Then the length of the path traced 
out by G is 

M = f ds = (6A)~i ( l(0)*dO, (3.28) 

whilst the mean distance of points in the figure from O is 

= {M. (3.29) 



This result can also be proved by calculating, in two ways, the prob- 
ability that a random chord and a random point in the figure are such 
that the centre of the figure and the random point lie on opposite sides 
of the chord. 

Applications in ecology 

3.20 The theory of the intersection of random lines with convex or 
other figures has been applied by G. A. Mclntyre (1953) to the estima- 
tion of plant cover or areal density in ecology. The figures may be of 
general shape, or convex, or circles, and the stronger the assumptions 
which can be legitimately made about them the stronger the conclusions 
that can be drawn. The method used is to place a straight line, of length 
L, called a ' 'transect' ', at random on the area to be surveyed and to 
measure its intersections with the regions which might, for example, be 
patches of some particular type of vegetation. 

Suppose first that the regions to be estimated are of any shape and 
that, on the average, they occupy a fraction P of the total area which we 
shall take as so large that edge effects can be neglected. The regions, 
which are non-overlapping, are not necessarily distributed "at random" 
in any sense. Suppose that they intersect the transect in intervals of 
lengths li (i 1, 2, ...). Then L~ l %li is an unbiassed estimator of P. 
For consider any element dx of the transect. The probability that this 
lies inside some region is P, and since the expectation of the sum of a set 
of quantities is the sum of the expectations, the expectation of the sum 
S/ z will be LP. An estimate of the variance of the estimate can be ob- 
tained by repeating the process a number of times and calculating the 
sample variance. 

3.21 What is of more interest is to be able to estimate the number of 
regions per unit area in the total area, and it is clear that this cannot be 
done without making further assumptions about the shape of the regions. 
We shall suppose them to be circles whose diameters 2R have a frequency 
distribution <f> (2R) and are such that the expected number of circles with 
diameters in the range (2/?, 2 (R -f dR)) which lie in a large area A is 
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so that the expected total number of circles in A is 



(3.30) 




Fig. 3.6 



3.22 The transect will form a chord or part chord with any circle which 
it intersects. We shall first suppose that we count only those intersec- 
tions with the transect which are whole chords, together with any part 
chords at one specified end of the transect. Let the resulting intervals 
be denoted by /(, 1' 2 Then circles of diameter D giving rise to inter- 
vals counted in this manner must lie in a region of the type shaded in 
Fig. 3.6 which has area 2RL. 

The probability that the transect intersects a given circle will be 
larger if the circle is large and therefore the probability distribution 
of 2R for circles which intersect the transect will be different from 
2(f> (2R) dR. In fact the expected number of intersections of the transect 
with circles whose diameters lie in the range 2R,2(R + dR) will be 

4LRty(2R)dR (3.31) 

which we shall write as 2f(2R) dR. This is not a probability but is pro- 
portional to one, the factor of proportionality being obtained by integra- 
tion, so that we get 

R ( f>(2R)dR 



f(2R)dR 

r/(2R)dR 

Jo 

as this probability distribution. 



I 



(3.32) 



3.23 It is now convenient to modify again the rule for counting inter- 
sects. We now count all complete chords but omit part chords from 
one specified end of the transect, and complete any part chords at the 
other end. Denote the resulting set of intervals by /J 7 , /' Then the 
expectation of S l[ ' is 



J o 



Rf(2R)dR, 



(3.33) 
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because the average length of intersection with a circle of radius R is 
\nR. We do not know <f> (2R) or f(2R) but the last expression can be 
written as 



J Rf(2R) dR f JJ R ~* 



because 

f *R- l f(2R)dR = 2LA ^(2/?)^ = LA. (3.35) 

Hence 



*- 1 )}- 1 , (3-36) 

where E f means the expectation relative to the distribution of diameters 
of circles which do intersect the transect. We can estimate E f (R) and 



and N~^D^\ (3.37) 

where N is the number of intersections with the transect, and D x , Z) 2 , ... 
are the (measured) diameters of the circles making the intersections 
/{', /g", .... Thus an estimate, by no means unbiassed, of A will be given 
by the expression 

- 1 . (3.38) 



3.24 We can in fact replace this by an even simpler estimate using only 
the values of the intercepts /'. For if a line intersects a circle of radius 
R the probability distribution of the length of intercept / is clearly 
given by 

"' <3.3 



From this we have E(l^ 1 ) - nR-\ (3.40) 

(/,) = frR. (3.41) 

Hence we can replace the above estimator by 

-.. (3.42) 

These estimators are dependent on the assumption that the regions 
are circular. In particular, any estimator based on expressions such as 
S (Q -1 will fail for regions having angles. Various modifications of these 
techniques are considered by Mclntyre for dealing with regions which 
do not depart too widely from circles. 




70 

3.25 We may also point out that it is possible to extend this theory to 
the estimation of the probability distribution 2</> (2R) dR from the fre- 
quency distribution of the l\, and we shall consider this type of problem 
in detail in the next chapter in connection with the estimation of the 
probability distribution of the radii of sets of spheres in three-dimen- 
sional spaces from the observed diameters of their intersections with 
random planes. 

Buffon's problem 

3.26 We may end our discussion of random lines in a plane by con- 
sidering the classical problem which was the beginning of this theory 
that of Buffon's needle (Buffon, 1777). 

A needle length L is placed at random on a plane on which are ruled 
parallel lines at unit distance apart and the probability is required that 
the needle intersects these lines. Suppose first that L < 1 so that only 
one intersection is possible. Let be the angle made by the needle with 
the direction of lines, and x the distance of its centre from one of them. 
Then for x uniformly distributed over the range ( -, |), and 6 uni- 
formly distributed over (0, 2n), the probability is 



r2n 

= (2*H 1 

J o 



LI sin (9 dO = 2jt~ l L. 



(3.43) 



If L is greater than unity consider the probability of at least one inter- 
section. This is 



1 r sl 

= -\ 

nj o 



sin- 1 lr l 



{\-L\smO\}dO 



n 



( 3 - 44 ) 



If the needle is thrown N times and in R of these throws at least one 
intersection is obtained, P = RN' 1 is an unbiassed estimator of P with 

Table 3.1 



Experimenter 


Needle 
length 


Throws 


Hits 


Estimate 


Wolf, 1850 


0-8 


5000 


2532 


3-1596 


Smith, 1855 


0-6 


3204 


1218-5 


3-1553 


De Morgan, c. 1860 


1-0 


600 


382-5 


3-137 


Fox, 1884 


0-75 


1030 


489 


3-1595 


Lazzerini, 1901 


0-83 


3408 


1808 


3-1415929 


Reina, 1925 


0-5419 


2520 


859 


3-1795 


Gridgeman, c. 1960 


0-7857 


2 


1 


3-143 
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variance N~ l P(l P). Hence it is possible to estimate n by experiments 
of this kind. Many such experiments have been made and are sum- 
marized and discussed by Gridgeman (1960) in a very interesting paper. 
Table 3.1 gives some details of these, the needle length being given as 
a fraction of the interlinear distance. 

3.27 The excellent results of these experiments are mainly due to an 
adroit use of "optional stopping", combined, in Lazzerini's case, with 
the good fortune that the estimate of n came to 355(1 13) "^ a well- 
known approximation to n discovered by Tsu Ch'ung-chih. If the true 
value of n was unknown before the experiment, so that optional stopping 
could not be used, a better method of estimating n is to cut out a large 
circle of wood and use a tape measure. 

3.28 Buffon's problem can be looked at in various ways and several ex- 
tensions have been considered, some of which throw light on a variety 
of other probability problems. For instance, suppose that a figure con- 
sisting of a line of length L, twisted into a definite shape, is thrown on 
a set of parallel lines, unit distance apart. Then the expected number 
of intersections is 2n ~ L /, independently of the shape. This is really (3.4) 
looked at in another way. 

3.29 Buffon's problem can also be treated (for L < 1) by using the 
previous results on convex figures. For suppose the needle is surrounded 
by a circle of unit diameter whose centre is the mid-point of the needle. 
This circle will intersect the lines in exactly one line whose intersection 
will be a random chord of the circle. From what has been proved before 
(3.1) it follows that the probability that this chord intersects the needle 
is 2L/n. 

3.30 Although n is naturally best found by quite other methods than 
an experimental realization of Buffon's problems, such Monte Carlo 
methods are often necessary in other problems, and to illustrate the 
important technique of "antithetic variates" Hammersley and Morton 
(1956) discuss a variation which is of great interest in suggesting similar 
methods in other applications. 

3.31 Suppose that the needle has unit length, and, in order to simplify 
the algebra, let the distance between the parallel lines be so small that 
a count of the number of intersections is equivalent to measuring the 
length of the projection of the needle in a direction parallel to the parallel 
lines. Thus each observation is a random variable X = |sin0 1, where 
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6 is uniformly distributed on (0, 2ri). Then X is an unbiassed estimator 
of 2n~ l with a coefficient of variation equal to V(i^ 2 ~~l) = 0-4834. 



3.32 If T is a variate used as an estimator in a Monte Carlo method, 
an antithetic variate T' is one with the same expectation as T and which 
is dependent on T in such a way that T' is large when T is small and 
vice versa. Adding the two together, we may hope for a much improved 
estimate. Hammersley and Morton illustrate this principle by consider- 
ing the projection of another similar needle fixed at right angles to the 
first. We then have 

x r = Hl sin0 l + l cos<9 l} ( 3 - 4S ) 

as an unbiassed estimator of 2n~ l with a coefficient of variation which 
can be verified to be equal to VfiV^ + s 7 *" 1} = 0*0977. Thus even if 
there is twice as much labour in each throw in the second method, the 
efficiency is 12-2 times the first method. This idea can be further ex- 
tended by using n needles in the form of a star with angles nn ~ l between 
successive needles. The coefficient of variation of the mean projection 
can then be shown to be 

jz 2 (12nV5)- l {l+0(rc- 2 )}. (3.46) 

3.33 This method would also apply when the distance between the 
parallel lines is not small enough to justify the above approximation, but 
the calculations then get considerably more complicated and we shall 
describe here only the classical case of a single needle. Suppose as before 
that the distance between the parallel lines is unity and that the needle 
has length L > 1 . Write P , P l9 P 2 , . . . for the probabilities of 0, 1, 2, ... 
intersections. We have already found P in (3.44). Suppose that 
L = n+l y where / < 1. Then there can be 0, 1, ..., n+ 1 intersections. 
Define a^ag, ...,a n to be angles such that 

Lsinaj = 1, Lsinoc 2 = 2,..., Lsinoc n = n. 

Consider now P n+1 . In order to obtain n + \ intersections, must 
lie between a n and \n (Fig. 3.7). 

For fixed we then see by considering two possible positions of the 
end of the needle that the probability of n + 1 intersections is L sin n. 
P n+1 is therefore equal to 

- f * (L sin 6 - n) = 2Ln ~ l cos a n + 2nn ~ l (a n - \n). (3 .47) 

n Ja n 

For k intersections to occur (0 < k < n+ 1) it is necessary that 6 
should lie in the range (<*>k-i> &k+i) an ^ we must consider the cases 
(<*k-i < ^ a fc )and(a fc < ^ a fc+1 ) separately. In the first, the condi- 
tional probability of k intersections is Lsin0-(&- 1), and in the second 
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Fig. 3.7 

it is (& + 1)-L. Thus the total probability is 
P k = 2 



n 



2L. 
H {cos a k+ ! I cos ajj. + cos a ft _! }, 



(3.48) 



except when = n for which we have to put a fc+1 = 



3.34 It is also possible to consider the probability of a needle, thrown 
at random, intersecting one of two sets of parallel lines which are at right 
angles to each other. If they are distant a and b apart and L < a, L < b 
then the probability that the needle crosses at least one line is 
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(3.49) 



This and various similar results are easily obtained (see amongst 
others Deltheil, 1926, Mantel 1953). 

Random lines in three dimensions 

3.35 The theory of random straight lines in three-dimensional space 
leads to a variety of problems some of which are analogous to those con- 
sidered above. We have already seen in Chapter 1 that if lines are repre- 
sented by four coordinates (0, i, />, <?), so that a line is determined by the 
equations 

x = az+p, 

y = bz + q, 



poo r oo 

Jo Jo 
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then the appropriate measure to use is defined by the element 

(l+a 2 +b*)-*dadbdpdq. (3.50) 

This differential element implies that if we suppose that lines are 
defined by their intersection P with some plane, then P must be uni- 
formly distributed at random over the plane and, independently of this, 
all directions are equally likely. Consider then the measure of the set of 
all lines which intersect a certain measurable set E lying in some plane 
which we can take as the plane Z = 0. Then the measure of such lines 
will equal the measure of E multiplied by the integral 

dadb 

(l-}-# 2 -t-A 2 ) 2 

Hence the measure of all lines intersecting a plane region of area A is 
n A. Since almost any line meets a three-dimensional convex region in 
two points, the measure of all lines meeting such a figure is ^nS, where 
S is the surface area of the figure. This can be interpreted in another 
way. Let P(n) be the area of the projection of the convex region on any 
plane with normal n, so that P, the mean of this projection for all n, is 

P = (4jr)- 1 [fp(n)&>, (3.52) 

where do) is the element of solid angle corresponding to n. Then 2jtP is 
the measure of all lines meeting the convex region (because each line is 
counted only once, whatever its "direction", and the possible directions 
fill a solid angle of size 2n), and equating this to the previous value we 
have S = 4P y which is "Cauchy's formula" for the area of a convex 
body. This provides a useful practical method for measuring the sur- 
face areas of small convex bodies such as stones. This method is the 
three-dimensional analogue of Steinhaus* method described earlier in 
this chapter, and a similar set of inequalities can be established for finite 
approximations (Moran, 1944). In fact, if P is the true mean value 
given by (3.52) and P 19 P 2 are the means of the projections of the convex 
body on the faces of a dodecahedron and icosahedron respectively, then 

0-91758 ^ P,P~ l ^ 1-07869 

0-9560 < P 2 P~* < 1-0472, (3.53) 

these limits being five- and four-figure values of expressions which are 
best possible limits. 

3.36 The "isepiphanic" theorem states that of all closed surfaces 
in three-dimensional space which are topologically equivalent to a 
sphere, and for which a surface area can be defined, the ratio S 3 V~ 2 
(V = volume) is a minimum for a sphere. This is a very difficult 
theorem to prove in general, but it is relatively easy when confined 
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to the class of convex surfaces. In that case it may be put in the form 
that P 3 V~ 2 is a minimum for a sphere. Now, for any closed surface, 
S > 4P where P is the mean projection on a random plane (each point 
in the projection being counted once only, even if more than two points 
of the surface project on to it) and it may therefore be conjectured that 
for any set E in space whose projection P(n,E) in the direction n has 
a surface measure which is a measurable function of n, .P 3 F~ 2 is a mini- 
mum for the sphere, where P is the mean value of P(n, E) in all direc- 
tions and V is the three-dimensional Lebesgue measure of E. This has 
not been proved. 

3.37 The second moment or better the variance of P(n) for a con- 
vex body is also a number which characterizes the convex body. It is 
clearly zero for a sphere and has been calculated for a circular cylinder 
and a rectangular parallelepiped by Walters (1947). 

3.38 A further consequence of the above results is that if K l is a convex 
region contained in a convex region K 2 , the probability that a random 
secant of K 2 intersects K is S^ S 2 ~ l y where S ly S 2 are the surface areas 
of K l and K 2 . In particular the probability that a random chord of a 
sphere intersects a given diametral plane inside the sphere is 2n (4-n) ~ l = i, 
since the circular area formed by the diametral plane can be regarded as 
a flat convex region with surface area 2n, the radius being taken as unity. 

3.39 With the probability element (3.50) we have, in effect, defined the 
line by its intersection with a fixed plane, and its direction. The above 
discussion suggests that it would also be valuable to define a line first by 
its direction and then by its intersection with some plane perpendicular 
to its direction, this representation being analogous to the use of the 
polar coordinates of a line in the plane. Thus, consider the plane 

ax + by + Z = 
which is perpendicular to the line given by 

x = az+p, 
y = bz+q. 

The direction cosines of the line are 

/ = a(l+0 2 + 6 2 )-*, m = 6(1+ a 2 + &*)-*, and n = (l+a 2 + i 2 )-*. 

The direction being fixed, an element of area in this plane will be 
got by varying p and q and is given by 

dS = (1 + a 2 + 6 2 )* dp dq = n ~ l dp dq. (3.54) 
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To obtain the element of solid angle, write 

/ = sin cos <, 
m = sin 6 sin <, 
n = cos#, 

so that the element of solid angle is 



dfl = sinOdOdt = r . (3.55) 

w i 

The Jacobian of / and m with respect to 6 and < is 

cos cos ( sin 6 sin ^ 
cos0sin</> sin cos ^ 
= cos sin 0, (3.56) 

so that the element of solid angle is n~ l dldm, and taking the Jacobian 
of /, m with respect to a and b we get 



Thus dSdQ, where */Q is the element of solid angle, is equal to (3.50). 

Mean length of secants of a convex body 

3.40 Now consider the mean length of secants of a convex body. Let 
K be the body and K(ri) its two-dimensional projection in the direction 
n. The integral of the length of a secant in the direction n taken over 
all its points of intersection in K(n) is clearly the volume of K, and the 
mean length is FP(n)- 1 , where P(n) is the area of K(ri). Weighting 
this by -P(n) and integrating over all directions, we get V. But the 
measure of all secants is the integral of P(n) over all directions, which 
is JS. Thus the mean length of a secant is 4FS" 1 . 

3.41 This result has some application in architectural acoustics as a 
measure of the mean free path in a room (Bate and Pillow, 1947). 
Suppose the wavelength to be so small that the sound-waves can be 
considered to travel in straight lines until reaching the walls, when they 
are reflected in the usual manner. Let P be the point inside the room 
which is emitting sound-waves uniformly in all directions. Taking one 
of these directions, we may suppose the path of the sound-wave to ex- 
tend for a long distance which, when divided by the number of reflec- 
tions, will give the mean free path in that direction from the point P. 
Averaging over all directions we obtain the mean free path from the 
point P. In general this will depend on the point P. Bate and Pillow 
show that for a room in the form of a rectangular parallelepiped the mean 
free path is independent of P and equal to 4VS~ 1 9 the mean length of 
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a secant. For a sphere of radius a with the point P at a distance b from 
the centre the mean free path is 

n^}" 1 . (3.58) 

a-b) 

Averaged over a uniform distribution in the sphere this is again 4-VS* 1 . 
For a cylinder the mean free path also depends on P, but averaged over 
all points the result is 4VS~ l . Thus it appears to be a general rule that 
the mean free path obtained by repeated reflection is equal to the length 
of a random secant of the body, but this has not been proved in general. 

3.42 Buffon's problem can also be generalized by considering a 
"needle" of length L placed at random in space in which there are an 
infinite number of parallel planes unit distance apart. Suppose first that 
L < 1 . Using polar coordinates for the direction of the needle, the axis 
of coordinates being perpendicular to the planes, the probability element 
is 

(4rc)- 1 sin0<#<ty (3.59) 

in the usual notation. Integrating over <, this can be written |J(cosO). 
Hence the projection of the needle on a line perpendicular to the planes 
is an interval of length /, where / has a uniform distribution on the in- 
terval (0, L). Since the position of this interval is random, the prob- 
ability of an intersection for any given orientation is / which, averaged 
over the distribution of orientations, gives \L for the required prob- 
ability. Similarly the distribution of the number of intersections for 
L > I is easy to find, and in any case it is obvious that the expected 
number of intersections is \L. 

3.43 As for the case ot random lines in a plane intersecting convex 
figures, it is possible to construct a theory of the intersection of random 
lines in three dimensions with convex regions which enables inferences 
about these regions to be drawn from knowledge of the intersections. 
This is more conveniently studied in the next chapter, in connection 
with the theory of intersections of random planes with convex figures. 
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CHAPTER 4 

RANDOM PLANES AND RANDOM ROTATIONS 

Random planes 

4.1 We have already seen in Chapter 1 that the appropriate element 
of measure for planes defined by the polar equation 

x sin 6 cos (f> +y sin 6 sin </> + z cos 6 = p 
is 

smOdOd<t>dp, (4.1) 

and that if we use the representation 

ux -f vy 4- wz -f 1 = 0, 
the corresponding element of measure is 

dudvdw 

' ^ * ' 



From these results we can obtain the measures of the sets of planes 
which satisfy any specified conditions. Thus consider the set of all 
planes which intersect a linear segment of length L. Take this segment 
along the #-axis and use polar coordinates. Then p will be uniformly 
distributed between and L cos 0. The measure of the set of planes will 
thus be 



r2n p2 rLcosO 

d</> sin<9 dpdO = nL. (4.3) 

Jo Jo Jo 

This can be generalized to a twisted curve of length L. Let 
N(0,<f> y p) be the number of intersections of a plane of coordinates 
(0,<f>,p) with the curve. Then by breaking up the curve into a set of 
nearly linear elements and adding we see that 



m 



N(0, 0, p) dd d<f> dp = nL. (4.4) 



4.2 (4.4) can be given another interpretation. Consider the projection 
of the curve on a plane perpendicular to the vector direction n and sup- 
pose this has length L (n). Let L be the average of L (n) over all direc- 
tions so that 
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where dw is the element of solid angle. Then by splitting up the curve 
into small parts which are approximately linear we see that 

L = 2n-*L. (4.5) 

This result can be obtained from the previous one. For consider the 
curve projected on the plane perpendicular to n. This has length L(n) 
which by (3.4) is equal to \n multiplied by the average number of inter- 
sections of the twisted curve with random planes perpendicular to the 
plane of projection. If we now integrate over all directions of n we see 
that 



L = 



(47*)'- 1 (L(n)dw = 



because in integrating over all directions of n we have counted each 
plane in space twice, thus providing a factor 2n. 

4.3 This result is similar to Steinhaus* result (Chapter 3) and it is 
curious that (as pointed out by Santalo, 1946) it is possible to prove 
results similar to (3.53) for the average length of the projections on the 
face of a dodecahedron or icosahedron. Let L t and L 2 be the mean 
projections in these two cases. Then, as in (3.53), 

0-91758 ^ L^L)- 1 ^ 1-07869, 

0-9560 ^ L 2 (L)~ l < 1-0472. (4.6) 

To see this it is first clearly sufficient to prove these inequalities when 
the curve is a straight line. This we can replace by a thin cylinder of 
diameter d, where d is small. Except for a small error term, the areal 
projections and surface area of this cylinder are proportional to the 
length, and by using the previous theorem the result follows. However, 
in this case the inequalities (4.6) are not the best possible and the best 
inequalities have not been established. 

Planes intersecting a curve 

4.4 Now consider the measure of the set of all planes which intersect 
a closed convex curve C lying in a fixed plane. Since every plane which 
intersects this curve must do so in two points, we see from (4.4) that 
the measure of all such planes is \nL, where L is the perimeter of C. 

4.5 Before extending this to the measure of planes intersecting convex 
figures in general, we describe another result, due to Barbier (1860), 
which generalizes more simply. Instead of obtaining the measure of all 
planes which intersect the plane convex figures, we obtain the integral, 
over the set of all such planes, of the length of the intersection. Suppose 
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that such a plane is given by 

xs'md cos(f>+ys'md sin^-f^cosfl = p 
and that the fixed plane is z = 0. The line of intersection is given by 

z 0, x cos (/>-}- y sm<f) = -r 



sin0' 
Let C be the length of the chord formed by this line, so that 

_ = S, the area of the convex figure. 
J sinO 5 

Then the total integral over all intersecting planes is 



( (( Csm6dpdOd<l> = r*ty f*si 

= 2nS [ 
J 



f 



fa*S. (4.7) 

o 

This approach ascribes to each plane a weight equal to the length of 
its intersection with the plane convex figure. If we have any reasonably 
regular surface in space with finite area, and L (O y cf), p) is the length of 
the curve of intersection with any plane, the integral of L over the set 
of all intersecting planes is equal to the surface area multiplied by \ri*. 

Planes intersecting a convex region 

4.6 Now consider the measure of all planes meeting a convex region K 
in space. Keeping 6 and </> fixed, the integral is equal to the length of 
the projection of the figure on a line in the direction (fl, 0). This length 
may be called the "thickness" (e'paisseur, Breite) of the figure in the 
direction (#,<). The measure of all planes meeting K is therefore 

J = | f f T(6,<f>) sinddddfr (4.8) 

where T(0,(f>) is the thickness in the direction (#,</) and the integral is 
taken over all directions. If K is replaced by a convex figure in a plane 
we have seen that J is equal to |jrL, where L is the perimeter of the 
convex figure. By analogy with this case E. Cartan (1896) proposed that 
2n~ l J be called the "perimeter" of the convex figure K. Minkowski 
(1903) has shown (and we prove below) that 

VS, (4.9) 

where K is sufficiently regular for the two principal radii of curvature 
to exist and be integrable, dS is the element of surface area, and the 
integral is taken over the whole surface. Having found M it then follows 
from the above that the mean length of the perimeter of the intersection 
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of K by an arbitrary plane is 



and since the volume, F, of K is obtained by integrating the area of the 
intersection over all possible planes and multiplying by (2ft)- 1 , the 
mean area of intersection is 

2nVM~\ (4.10) 

Minkowski's theorem 

4.7 We now prove (4.9) directly by the method given by Deltheil. 
Taking the origin of polar coordinates inside K and writing 

a = sin0 cos<, ft = sinfl sin<, y = cos0 

for the direction cosines of a vector issuing from the origin, we see that 
the tangent plane, or plane of support, is given by 

au + fy + yz^Hfrfi, (4.11) 

where H(0 y (/)) is the function of support, so that J is the integral of H 
over all directions. Suppose that the tangent plane always touches the 
surface at a single point M (when this is not true the result can be 
obtained by a limiting process). The coordinates of M can be obtained 
in terms of 9 and < by solving (4.11), and the two equations obtained 
by differentiating (4.11) with respect to 6 and </>. We get in this way 

rr - n j dH a A dH 

x = H sin cos d) -f cos cos 



-, 

ov d(/> sine/ 

. . . 

y = 



- 

OU Off) 

z = HcosO-^-sinO. (4.12) 

ou 

The form of these equations shows that the line from the origin per- 
pendicular to the plane of support can be regarded as having com- 
ponents 

dH . dH 

H ' w t"* 9 )- 1 * 

relative to orthogonal coordinate axes whose direction cosines are given 
by 

(a,/9,y) = (sin cos <^, sin sin <^, cosfl), 

(a',/?',/) = (cos cos ^, cos0sin<, -sin 6), 
(a",r,/') = (-sin^,cos^,0). (4.13) 

At the tangent point M there will be two lines of curvature each with 
its own principal radius of curvature. Let p be one of these. By a 
G.P. 
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standard formula in differential geometry it follows that for a displace- 
ment along the corresponding line of curvature we have 

dx dy dz 
da^Tp^dj^ ~ P * 
Hence along this line of curvature <f> will be a function of such that 

dx da. dy d/3 dz dy 



= = 

dx doi~ dy dp ~~ dz dy ( 4 ' 14 ) 



r\ - __ _ Q - 

d<p d(f> d(p d(p d(f> 

The vector (a, /?, y) is in the direction of the perpendicular on the 
plane of support and is therefore at right angles to the infinitesimal dis- 
placements of P when and cf> vary. Hence 

dx n dy dz 

- 



Differentiating these with respect to 6 and <j> using (4.13) we get 



d 2 x 

say, 



= 2a 



= Sa +Tsin 2 (9, say. (4.16) 

Multiplying the last three ratios in (4.14) by a, ft, y (or a', /?', 7') in 
the numerator and denominator and adding, we therefore get 

R-p _ S^ 
S sin 6 " (r-/)~)sin~0 
from which we obtain the quadratic equation 

p*-(R+T) P + RT-S 2 = 0. (4.17) 

To find R, T, and 5 in terms of 6 and <f> we differentiate (4.11) with 
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respect to 6 keeping x, y, and z fixed, and then again with respect to 
allowing x, y, and z to vary, obtaining 

'*'% = +w- < 4 - I8 > 

Differentiating with respect to </> and allowing x, y, and z to vary, we get 

d*H dH 

S = (smd)-i~^-cosO(sme)-*. (4.19) 

Differentiating (4.10) with respect to (/> we get 

Sa"* = ( 8 in0)- 1 ~ (4.20) 

O(p 

and differentiating again, letting #, y, and z vary, we get 



and therefore 



From this 



TsinO-xcos<l>-ysm<f> = (sine)- 1 -, (4.21) 

v<fr 



7)J-f TfiJ-f 

T = ,+ 



{^rr ffilJ ^f-J^\ 

H+ k gg cotfl + i _+ 4( 8 infl)--^|. (4.22) 

The lines of curvature are orthogonal, and if we represent an element 
of area on the surface around the point M by da we have 

da = ( PlP j-i<Kl, 

where dQ is the corresponding element of solid angle traced out by the 
vector in the direction (6, <j>). Minkowski's theorem then takes the 
form 



fij-f ffiPf ffij-f} 

H+^cote+l w + l(sme)-*^ n ddOd<f>, (4.23) 
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where ^ < ?r, < </> ^ 2jt. To prove the result it is therefore 
sufficient to show that 

A = J J (sin0)-i W = 0, (4.24) 



and B= cos0+8in0-<0<ty = 0. (4.25) 

2/T 

v4 is zero, since after integrating with respect to <f> we get ----- which is 

0(f) 

periodic. In B we integrate first with respect to and obtain 

sinfl^, (4.26) 

which is zero at = 0, TT. Thus Minkowski's formula is demonstrated. 

4.8 It follows that if K l is a convex body interior to K, and M K ^ and 
M K the corresponding integrals (4.10), the probability that a random 
plane intersecting K also intersects K l is 

M Ki Mji l . (4.27) 

This is the spatial analogue of the similar result for plane figures implied 
by formula (3.2). 

Some further results 

4.9 We can obtain a number of other means and probabilities in this 
way (Hostinsky, 1925). For example, we have seen from (4.7) that if L 
is the length of the perimeter of the intersection of a plane with K, the 
integral of L over all intersecting planes is %n 2 S. Hence the mean 
perimeter is \n 2 SM~ l . Similarly, since the integral of the area of inter- 
section over all intersecting planes is 2n V, where V is the volume of K, 
the mean area of intersection is 2n VM~ l . 

4.10 Now consider two independently random planes both intersecting 
K. The measure of all such planes (counting them as distinguishable) 
is M 2 . To obtain the probability that their line of intersection inter- 
sects K we have to find the measure of all such pairs satisfying this 
property. Suppose one of them is fixed and has an intersection with K 
whose perimeter is L. Then the measure of planes which intersect this 
intersection is \nL from (4.3). On the other hand we have seen that 
the integral of L over all intersecting planes is \rP S, so that the measure 
of all pairs of planes intersecting K and whose intersection intersects K 
is Jtt 3 S. Thus the probability of two planes which intersect K having 
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an intersection which intersects K is 



4M- 



4.11 Now consider the probability that three planes intersecting K 
have their common point inside K. The measure of all triples of inter- 
secting planes is M 3 (regarding the planes as distinguishable, so that 
each geometric triple is counted six times). Suppose two of these planes 
intersect inside K. The measure of all planes which intersect this inter- 
section is n C, where C is the length of the intersection. The integral of 
C over all positions of one of these planes is |jc 2 5, where S is the area 
of intersection of the other plane. In turn, the integral of S is 2n V. 
Hence the measure of all such triples is n* V> and the required prob- 
ability is 



4.12 We have already seen that the mean value of the length of the 
intersection made by a random line intersecting K is 4VS~ l . Let such 
a chord have length C. Then it is possible to find some of the higher 
moments of C. Hostinsky (1925) has obtained the fourth moment. 
Consider two points A l and A 2 inside K and with coordinates (x ly y ly zj 
and x 2J29 z 2 . Then 

^dy.dz^dx^dy^dz^ (4.30) 

Let A l be fixed for the moment and let a line through it form a chord 
of length C, A being at a distance r from one end of the chord and 
C r from the other. Suppose A l is the vertex of a cone surrounding 
the chord and having a small solid angle dl. Integrating with respect 
to (# 2 , y 2y # 2 ) over the volume of this cone, we obtain 

Hr + (C-r)}fiJ. (4.31) 

We can choose the coordinates of A in such a way that one cor- 
responds to the direction of the chord and the other two are perpen- 
dicular and form an element of area dQ. Then 



(4>32) 
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From the previous results we see that 

JJJJ 40^ = ^5, (4.33) 

and hence the mean value of C 4 is 

\2V*(nS)~*. (4.34) 

4.13 Hostinsky considers some other means of a similar kind and also 
the analogue in three dimensions of Sylvester's quadrilateral problem, 
i.e. given five points chosen at random inside a convex body K, to 
obtain the probability that none lies inside the tetrahedron formed by 
the other four. 

Distribution of sizes of particles 

4.14 We now consider the problem of determining the distribution of 
the sizes of particles embedded in an opaque medium from the measure- 
ment of the figures formed by their intersections with a random plane, 
or from the segments formed by their intersection with a random line. 
This problem has application in a number of scientific fields, and has 
been considered by a number of authors, some of them independently, 
e.g. Wicksell (1925, 1926), Scheil (1931), Pullman (1953), Reid (1955), 
Santalo (1955), and Krumbein and Pettijohn (1938). 

4.15 Suppose first that the particles are all spherical and distributed 
in such a way that the mean number of centres in unit volume is A. We 
can treat the centres as being distributed in a Poisson field, although 
the spheres are non-overlapping since the plane of intersection is itself 
chosen at random. Let the probability distribution of the diameters 
have density F(r), so that (r being used for a diameter) 



is the probability of a sphere chosen at random having a diameter less 
than a. The expected number of spheres whose diameters lie in the 
range (r, r +dr) and which intersect an arbitrary plane in a unit area of 
the latter will be ArF(r)dr y and hence the probability density of the 
distribution of diameters of spheres which intersect the plane will be 

f(r\ rji \ r ) _ r ** v) /A ?c\ 

J\T) - , JT-.J3; 

rFMdr 

Jo 

where r is the mean diameter of a sphere chosen at random. 

If a sphere of diameter r does intersect the plane, the probability 
that its centre lies at a distance y from the plane is 2r~ l dy (0<y< |r), 
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and the diameter x of the circle of intersection is (r 2 4y 2 ). Hence the 
expected number of spheres having diameters in the range (r, r +dr) and 
intersecting the plane in a circle of diameter x whose centre lies in a unit 
area is 

A^ 1 F(r) x (r 2 - * 2 ) -* dr dx. (4.36) 

The probability distribution, (/> (r) dr, of the diameters of the circle 
of intersection when it is known that the sphere does intersect the plane 
is therefore 

(4 ' 37) 

Since <f>(x) can be found by observation, we have to determine F(r) by 
the integral equation 



Write F(r) = rF^r 2 ) and we obtain 



which is an integral equation of AbePstype (Courantand Hilbert, 1931, 
p. 134) whose solution is 

r \d 

\(z) = 2n~ l (x 2 z)-*r ---(x- l <f>(x))xdx y (4.40) 



F 

from which we obtain 



Given observations of the diameters of the circles we can estimate 
the distribution (x) and hence calculate F(x) from (4.41). For numeri- 
cal work it is probably best to integrate (4.41) by parts first, so as to 
avoid having to estimate a derivative. 

4.16 F(r) and <(#) are both the densities of probability distributions, 
and it is interesting to express the moments of one distribution in terms 
of those of the other. Let M n and m n be these moments, so that 

M n = |"W(r)<fr, m n = r^<j>(x)d x . (4.42) 

Jo Jo 

Using (4.37) we have 

m n = r x n <f>(x)dx = rjT 1 { ^ n + 1 f ~ (r*-&)-*F(r)drdx, (4.43) 

JO JO J x 



88 

and inverting the order of integration (which can be justified) we get 

m n = r^ 1 ["r" + iF(r)dr { * w n + l (l-w*)-* dzv. (4.44) 
Jo Jo 

The second integral is of the form 

2 ' 4 ' 6 - ...- 



C** n 

sm w 

Jo 



T - jj 

, if wis odd, 
3 . 5 .... w 

1 .3. ... n In ... 
OA -o> if " is even, 

2.4. ... 2 



= /*> say- 
Then 

= ri-V.+i M.+I = /.ft M- 1 M n+1 , (4-45) 

since r = M x . This provides us with another method of determining 
the distribution, since if we can estimate m ly m 2 , ... from the observa- 
tions, we can find MjjMf 1 , MgMf 1 , ... and there remains only the 
problem of finding M v r . One way of doing this is to use the fact 
that r , the mean of the diameters of the spheres, is equal to \K times 
the harmonic mean of the observed diameters. To prove this we have 
r {harmonic mean of observed diameters} 



= r dr {' F(r)(r*-x*)-*dx = \n \* F(r)dr = \n. 
J o J o Jo 

It is also interesting to note that it is possible for the distributions of true 
diameter and observed diameter to coincide, i.e. for <j>(x] F(x). This 
occurs when 

F( X ) = ^xp-~, (4.46) 

for then 

r = cr(|jr)% 

and by (4. 19) 



Equation (4.38) also occurs in the theory of the "globular cluster prob- 
lem" in astronomy (Wicksell, 1925). A globular cluster is a collection 
of stars arranged around a common centre in spherical layers of equal 
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density. The relationship between the density in these layers as a func- 
tion of the radial distance and the density in a circular ring in the projec- 
tion on a plane is again given by an integral equation of the form (4.38). 

4.17 We also see that it is possible to set up an analogous theory for 
circles in a plane cut by random straight lines, and this links up with 
the investigations of Mclntyre described in Chapter 3. The resulting 
integral equation (which is obtained by Reid, 1955) is again of Abel's 
type and can be solved by similar methods. 

4.18 Wicksell (1926) has considered the case of ellipsoidal particles. 
Suppose these are all of the same size and same shape, and that one of 
them is cut by some plane. The intersection is an ellipse, and as the 
' 'diameter" of this ellipse we take x = \/(f if 2)* where f x and f 2 are the 
major and minor axes of the ellipse. If the major and minor axes of 
the elliptical section through the centre of the body are cr 1 a 2 we simi- 
larly define p = V( a i^z) as the "diameter" of this section. Then if y is 
twice the distance of the sectional plane from the centre of the body, and 
h is the distance between the two tangent planes parallel to the section, 
we have 

y _ V(P*-**) 
h~ p ' 

and hence we can write as before 

<4 ' 47) 



where </>(x) is the probability distribution of the "diameters" of the 
observed sections and /(/>) is the probability distribution of the dia- 
meters p of the central sections of those particles which do intersect the 
plane, these values p being measured in the plane parallel to the section 
through the centre of the particle. The main problem is to relate f(p) 
to the distribution of the size and shape of the particles. This is difficult 
since the distribution of p is not that of the diameter of the ellipse formed 
by a random plane through the centre of the particle unless the latter is 
spherical. This may be best seen by considering ellipsoids which have 
one axis much longer than the other two, and which are therefore more 
likely to be intersected by planes perpendicular to the direction of this 
axis. As a consequence the resulting analysis carried out by Wicksell is 
rather complicated. 

Thin sections 

4.19 Krumbein (1935) (Krumbein and Pettijohn, 1938) applied the 
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above theory to the analysis of thin sections of sediments in petro- 
graphy, assuming the inclusions to be spheres. His original theory 
neglected the fact that larger spheres have a larger probability of being 
included in the section. He obtained a closer fit to observed values with 
the simpler and incorrect theory when examining two samples of sand 
both by this cross-section method and by actual measurement of the 
particle size after separation. The reasons for this are not clear and may 
be connected with the fact that the particles are not in fact spherical. 
Later work by Greenman (195 la, 1951b), Rosenfeld, Jacobson and 
Perm (1953), and Packham (1955), also uses the incorrect equation but 
obtains good fits to size distribution obtained by sieving. 

4.20 If the assumption of sphericity is dropped, we may consider just 
what can and cannot be deduced by a plane section. Suppose that on 
the average the number of included bodies in unit volume is A, that their 
average volume is v, and that their average surface area is a. Then 
without further assumptions it does not seem possible to estimate A. 
Let A v , A py and L p be the average number of intersections with a random 
plane per unit area, their average area, and their average perimeter, and 
similarly let A t , L t be the average number of intersections per unit length 
of a random line, and their average length. These five quantities can 
be estimated as accurately as we wish by repeated sampling. Clearly, 

I, A, = ^L, = to, (4.48) 

so that from either plane or line intersection we can estimate the pro- 
portion of the volume occupied by particles. A^ L P will be the average 
sum of the perimeters of all intersections per unit area, and by (3.3) this 
will equal 2n~ l A t multiples of unit length. On the other hand, 4A t units 
of area will equal the average total surface area of all particles in unit 
volume by Cauchy's formula derived from (3.52). Thus the total surface 
area of the particles can be found. 

4.21 Although it really belongs to the previous chapter, we may con- 
veniently consider here the problem of the distribution of the intervals 
formed by the intersection of random lines with convex figures em- 
bedded in some volume. We consider only the case where these figures 
are spherical and such that their diameters have a distribution with the 
probability density F(r). Let / be the length of an intersection and 
g (/) the probability distribution of such lengths. The probability of a 
sphere of radius r intersecting a random line is proportional to Jrcr 2 , and 
hence if/! (r) is the probability density of the distribution of r for those 
spheres which do intersect the line, 
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r*F(r)dr 

J o 

= r*F(r)/r 2 , (4.49) 

where r 2 is the second moment about the origin of the distribution 
of r in a randomly chosen sphere. Let x be the distance of the centre of 
the sphere from the line. Then the distribution of x, given that an 
intersection occurs, is 

8xr~*dx (0 < x < |r). (4.50) 

For a given value of x, the length of the intersection is 



and hence the distribution of / is 

2r~*ldl (0 < / < r). (4.51) 

The overall distribution of / then has probability density F(r) where 



= 2M7 1 (~ F(r)dr. (4.52) 

The solution of this equation is immediately 

< 4 - 53 ' 



r 2 is probably best obtained by estimating the limit of 21 g (I) ~ l as / tends 
to zero, since this converges to r 2 . As in the previous theory, it is also 
possible to set up equations connecting the moments of the distribution 
given by g(l) and that given by F(r). Attempts have been made to 
generalize (4.52) to cases where the bodies are not spherical, but these 
do not seem satisfactory. 

Figures in three-dimensional space 

4.22 Another set of problems closely related to these have been con- 
sidered, with plane figures and finite linear segments distributed at ran- 
dom in space, and also projections of linear segments on a fixed plane. 
Suppose first that circular disks of diameters r are distributed at 
random in space and are cut by a random plane in intersects of length x. 
Let F(r) be the probability density of the distribution of the r's, and 
similarly <f> (x) the probability density of the distribution of the lengths x. 
The probability that a given disk is cut by the plane is clearly propor- 
tional to r, and hence if /(r) is the probability density of the distribution 
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of diameters of the disks which do intersect the plane, we have 

/ - r ;^ 

rF(r)dr 

= rr^F(r), (4.54) 

where r is the mean diameter of a disk chosen at random. If the plane 
cuts a disk of diameter r at a distance y from its centre, the distribution 
of y is 2r~ l dy (0 ^ y ^ r), and the distribution of x, the length of 
the intersect, is therefore 

xdx 

rV(r 2 -* 2 )' 
We thus obtain 



which is the same equation as (4.37) and may be treated by the same 
methods. 

4.23 Now consider the projections of segments and plane figures. 
Suppose that linear segments of lengths / are distributed in space with 
random directions and are such that each length / has a probability dis- 
tribution with density F(l). Let x be the length of the projection of the 
segment on a random plane. 0, the angle between the segment and the 
normal to the plane, will have a distribution J sinfl dO (0 < 6 ^ n). Given 
/, the probability distribution of the projection x = IsinO will therefore 
be 

xI~*(P-x*)-*dx (0 ^ * < /) 

and <f>(x), the density of the probability distribution of the projections x, 
will therefore be given by 

This is of the same form as (4.47), the equation used by Krumbein 
in his approximation. (4.56) can be reduced to an integral equation of 
Abel's type, and its solution verified to be 

- 



4.24 If we have plane figures of areas a distributed with random 
orientations in space, and such that a is distributed with probability 
density F(d) y we consider the areas, a say, of their projections on a fixed 
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plane. Suppose that the oc's have a distribution with probability density 
0(<x). We have a = a|cos0|, and |cos0| is uniformly distributed on 
the range ^ | cosO | ^ 1, so that we obtain 

"-("Ifa (4.58) 

a d 

whose solution is obviously 

F(*) = af (a). (4.59) 

Random rotations 

4.25 We now consider random rotations in Euclidean space. In two 
dimensions such a rotation is completely described by an angle through 
which the body or figure is rotated. The only natural criterion for geo- 
metric randomness is the condition that any measurable set of values 
of should have a probability which is invariant under any rotation. 
It is then obvious that the only probability measure which satisfies this 
condition is such that the probability of any set of 0's is equal to their 
Lebesgue measure divided by 2n. 

4.26 In three dimensions the situation is much more complicated. We 
can describe such a rotation either by giving the elements of the ortho- 
gonal matrix corresponding to the rotation or by giving the direction 
cosines (a, /?, y, say) of the axis of rotation, together with the angle 
(V y say) through which the body is rotated. In either case there are 
three effectively independent coordinates, and the space of all rotations 
is three-dimensional. 

Once again the natural condition to impose on the probability meas- 
ure is that it is invariant under any arbitrary rotation. If this is to be 
true it is natural to assume that these conditions will be satisfied if all 
directions of the axis of rotation are equally probable (in the sense used 
in previous chapters) and that the angle of rotation must be uniformly 
distributed on the range (0, 2n). The first of these conclusions is clearly 
a necessary consequence of the invariance condition, but the second is 
false. Following Deltheil (1926), we shall now determine the correct 
distribution of V by using infinitesimal transformations. 

4.27 Take rectangular axes of coordinates through the fixed point of 
the body whose initial position we describe by 5 . We define a "rota- 
tion vector " corresponding to the rotation, as a vector whose direction 
cosines are (a, /?, y) the direction cosines of the axis of rotation and 
whose length is tan \V, where V is the angle of rotation. The com- 
ponents of this vector are 



(4.60) 
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/= a tan |F, 
m = /Stan fF, 
# = 7 tan | F, 

and the elementary probability that we have to find is 

F(l,m,n)dldmdn. 

It is convenient to transform (l,m,ri) into homogeneous coordinates 
which are defined as follows: 

A = a sin \ F, 
// = /?sin JF, 
v = y sin | F, 
p = cosF. (4.61) 

Now suppose that the position S l is derived from S by the trans- 
formation (A!,/*!, 1^,/Di), the position 5 2 from S l by (A 2 ,/* 2 , v a , p 2 ), and 
5 2 from 5 by (A 3 ,/^ 3 , i> 3 , /> 3 ). Call these rotations R ly R 2 and jR 3 
respectively. Our first aim is to express (^3,^3, v 3y /o 3 ) in terms of 
(AI,/MI, ^1, PI) and (A 2 ,// 2 , ^ 2 , /3 2 )- To do this we consider first the special 
case where R 1 and R 2 are rotations through angles n. Such a rotation is 
known as a "symmetry" (Fig. 4.1). 




Fig. 4.1 

Let (!, b lt x) and (a a , 6 2 , c 2 ) be the direction cosines of the axes of 
R! and R 2 . The resultant rotation /? 3 will have an axis perpendicular to 
both O/?! and OR 2 . Consider an initial point A which is transformed 
into A l by R 19 and let A be transformed into A 2 by R 2 . We can take A 
in the plane R 1 OR 29 and then the total angle through which it is 
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rotated is A OA 29 which we denote by V. Then \V = angle I^OI^ 
where / x and 7 2 are the perpendiculars from A and ^ on the axes 
OR 1 and OR 2 . From the definition above we have 

cos^V = a l a 2 -\-b l b 2 +c 1 c 2 

Since O/? 3 is perpendicular to O/?! and O/? 2 , its direction cosines 
are proportional to 



M. -*&i, (4.62) 

which are the components of the vector product of O/ 2 into O/!. The 
magnitude of this vector product is sin \V since \V\& the angle between 
O/! and O/ 2 , and hence 



"3 = fli6 2 -02*i (4.63) 

and we also have /> 3 = a l a 2 +b l b 2 J rc l c 2 . (4.64) 

This result holds only for a rotation through an angle n. Now con- 
sider a rotation of arbitrary amount. Referring again to Fig. 4.1 and 
relaxing the condition that A Q lies in the plane through O perpendicular 
to Ol?3, we see that any rotation about an axis OR 3 can be replaced by 
the product of two rotations through n about two axes, OSj and OS 2 
say, perpendicular to O7? 3 . Moreover, this can be done in an infinity of 
ways, since we can take for OS l any axis perpendicular to OR, OS 2 being 
then taken at an angle \V to OS ly where V is the required angle of 
rotation. 

Returning to the general case where the rotation R 3 (A 3 ,// 3 , v 3y p 3 ) 
is the result of applying /? x = (Aj, // x , v l9 p x ) first and then 
RZ = (A>2iJ Ll ' 2 ,v 2 ,p 2 ), we decompose R l into two symmetries T x and T[, 
and similarly R 2 into T 2 and T! 2y so that /? 3 is the result of applying first 
7\, then T[ y then T" 2 , and finally T'%. From the arbitrariness of the de- 
composition of a rotation into symmetries we can choose T[ and T 2 to 
be both rotations about the common perpendicular to R l and R 2 . Their 
joint effect is therefore to leave the position of the solid unchanged so 
that R 3 is the result of applying first T and then T%. 

Let (a l9 b l9 c^ be the direction cosines of 7\, (a, b, c) those of T[ and 
T 2 , and (a 2y b 2y c 2 ) those of J 1 ^. Then R t has the homogeneous co- 
ordinates 



Pi = ^i +bb 1 +cc lt (4.65) 
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R 2 has the coordinates 



= bc 2 -b 2 c, 



v 2 = ab 2 -a 2 b y 

p 2 = aa 2 +bb 2 +cc 2 . (4.66) 



has the coordinates 



^3 = a 1 b 2 -a 2 b ly 



and we want to express the latter in terms of the former two. Using 
(4.65), we solve for (a ly b ly cj in terms of (A^/^, v ly p^ and (a, b, c); and 
in (4.66) we solve for (a 2y b 2 ,c 2 ) in terms of (A 2 ,// 2 >^2> ^2) 
thus obtaining 



2 = ap 2 -bv 2 +cju, 2y 
b 2 = av 2 +bp 2 -ch 2y 
c 2 = -0// 2 +iA 2 +r/> 2 . (4.68) 

We also have identically 

d l = a^ l -{-b/i l -}-cr l = 0, 

d 2 = aA 2 +bju, 2 +cr 2 = 0. (4.69) 

Inserting (4.68) in (4.67) and using (4.69), we now have 
A 3 = b 1 c 2 cb 2 +a l d 2 +d 1 a 2 , 



= C l a 2 -a l c 2 +b l d 2 +d l b 2 , 



(4.70) 

These are the required equations giving the composition of two 
rotations. We now return to the original coordinates which we take as 
(li> m i>iii) f r -Ki> (^2> W 2> W 2) f r ^2> an d (/s, ^3, w 3 ) for the resultant of 
both. We therefore get 
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+m 2 +#2 /i #1/2 



_ 

\ Iil 2 m l m 2 

From this we derive the equations for an infinitesimal transforma- 
tion by considering the principal parts of (/ 3 l 2 ,m a m 2 ,n B n 2 ) when 
/! Wi, HI tend to zero. In this way we find, writing (/, m, ri) for the com- 
mon value of (/ 2 , m 2 , n 2 ) and (/ 3 , w 3 , n 3 ), 



dm = (m/H-n)/!+(l +m 2 )m l +(mn-l)n 1 , 
dn = (In-m)l 1 +(mn+l)m 1 +(l +n*)n l , 

where l^m ly n l are small. 

We now follow the method of argument used in Chapter 1 and 
obtain the following equations for the density element F(l,m,ri): 

dF dF dF 

= 0, 



dF dF dF 

(Im - n) " 7 +(1 -f m 2 ) +(mn +/) +4mF = 0, 
uL om on 

dF dF dF 

(ln+m)+(mn-l)-- +(1 -fw 2 ) +4nF = 0. 
oL om on 

Multiplying these by /, n, and m respectively and adding we get 

dF 
(1 +/ 2 +m 2 +n 2 ) -f-4//^ = (4.72) 

<7/ 

and there are two similar equations from which we obtain 
dF 4 (Idl+mdm +ndn) _ 



and hence F is a multiple of 

(l+/ 2 +w 2 +w 2 )- 2 . (4.74) 

We now transform into polar coordinates by the equations 

/ = tan ^FsinQ cos<, 

m = tan ^Fsin0 sin^, 

n = tan ^F cos 0. (4.75) 

Then 



and 

dldmdn = $sm*V(cos%V)-*smOdVdOd<f> 
so that the element in the parameter space must be 

dj = %sin*%VsmOdVd6d<f>. (4.76) 



G.P. 
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The range of variation is ^ V < 2n, < 6 ^ n y < ^ 2n t so that 
to convert this measure into a probability measure we must multiply 
by i?r~ 2 , obtaining 



dP = 



sn 



sm6 dVd6dcf>. (4.77) 

Our original guess was that we should choose the axis of rotation at 
random, and then the angle of rotation uniformly in the range (0, 2n). 
This would give 

dP = fa- 2 $inOdVd6d<t>, 
which is therefore wrong. 



4.28 The other approach to this problem suggested that an orthogonal 
transformation be chosen at random by choosing three independent 

random vectors and orthogonalizing 
them. Thus we could take a random 
unit vector x 1 by taking its direction 
cosines proportional to three inde- 
pendent normal variates. Let x 2 be 
another such random unit vector, 
and y a unit vector parallel to the 
component of x 2 perpendicular to x x , 
which is x 2 X 1 (x 1 .x 2 ). Then the 
third vector, z, is defined to be the 
vector product, x x x y, and (x 1? y, z) 
can be taken as the unit vectors in a 
new set of coordinates, and their re- 
presentation in terms of the unit 
vectors of the original coordinate 
system provides the required ortho- 
gonal matrix. That this provides the 
correct orthogonal matrix is nearly 
obvious, but we shall verify it in detail. 

To do this we first consider the representation of the original vector 
of rotation in terms of an orthogonal matrix by Euler's transformation. 
Suppose that the rotation was through an angle V about an axis with 
direction cosines (/, m, n). Let O be the origin of coordinates and a the 
unit vector (l t m,ri) along the axis of rotation (Fig. 4.2). If P is any 
point of the body, suppose that it is rotated into a new position P' and 
that OP, OP' are vectors p and p'. PC is perpendicular to a, and M is 
the foot of the perpendicular from P' on PC. Then 

p' = OC +CF = OC +CM +MF. 




99 

Here 

OC = (p.a)a, 
CM = CPcosF 

= (p-(p.a)a)cosF, 

,, / JCF|sin7 
MP' = (axp) 1 -'- , 

|p | sin a 

where a is the angle between OP and OC. Then 

|CP'| = |CP| = |p | sin a, 
so that 

MF - (axp)sinK 
Thus 

p' = pcosF-f(axp)sinF+(p.a)a(l-cos F), 

and representing this transformation by a matrix we get 
cos V + P (1 - cos V) Im (1 - cos V) - n sin V ln(\- cos V) +m sinF 
lm(\ cos V} +wsin V cos V +w?(\ cos V) mn(lcos V) /sin F 
/w(l-cos F)-msinF wm(l-cosF) +/sinK cos F-fw 2 (l-cos F) 

(4.78) 

This gives the matrix representation of the transformation of the 
coordinates of a point by the given rotation. The effect on the co- 
ordinate axis would be given by a matrix of the same form with V 
replaced by V. Notice in particular that the trace of the matrix is 
1 -f2cos V and depends only on V. 

4.29 We now return to the random orthogonal matrix and construct 
it in a slightly different way. Choose random variables and </> which 
are uniformly distributed on ( 1, 1) and ( TT, n) respectively, and write 
for the first row of the matrix 

{, (l-! 2 )*cos0, (!-)* sin#. (4.79) 

These are clearly the coordinates of a unit vector with random direction. 
Now construct unit vectors A and B defined by 



B = (0, sin<, - cosf}. (4.80) 

These are unit vectors which are orthogonal to each other and to 
the first vector. Write 

C = A cos a -f B sin a, 

D= A sin a -fB cos a, 

where a is a random variable uniformly distributed on the interval 
( 7r,7r). Then C and D can be taken as the other axes of the new co- 
ordinate system, and the whole random orthogonal matrix is 
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(!-!)* cos (l-f)sin^ 

(1 I 2 ) 1 cos a f cos ^ cos a +sin<sina f sin < cos a cos $ cos a 
(1 2 )*sina cos<sina +sin^cosa ^sin^sina cos^cosa 

(4.81) 

which we identify with the matrix previously given. The trace is equal 
to 

cos <f> cos a 4-sin <f> sin a -f sin < sin a cos </> cos a 

= {l-cos((+a)}-cos(<+a). (4.82) 

The trace, which we identify with 1 +2 cos V, is therefore distri- 
buted as 

(1 cos ^) cos y (4.83) 

where is uniformly distributed on (1,1) and y is uniformly dis- 
tributed on (0,7r). cos V is therefore distributed as 

sin 2 ^-cos 2 |v>. (4.84) 

If z0 = cos F and we take y as fixed, w is distributed uniformly on 
( 1, cos \p). The probability density of w is therefore proportional to 

f * 
f(w)dw dw (2 sin 2 1 \p)~ l (hp 

J n -cob" 1 w 

= tan I (cos -1 w) dw 

= 2sin 2 |I / ^J 7 (4.85) 

which agrees with the previous result (4.76) when we insist that positive 
and negative values of V have equal probabilities. Thus we have proved 
that the choice of a random rotation is equivalent to choosing a random 
three-dimensional orthogonal matrix with a probability distribution 
which is invariant under orthogonal transformations. This distribution 
is an invariant measure, and since a sample of n quantities from a normal 
distribution with zero mean can be represented in ^-dimensional space 
by a vector having a distribution which is spherically symmetric, it is 
natural that these ideas should find application in statistical theory, 
especially in connection with multivariate analysis. This has been dis- 
cussed in great detail by James (1954), who applies the theory of invari- 
ant measure in the space of all orthogonal n x n matrices to derive many 
of the standard distributions used in multivariate analysis. 

4.30 It would also be possible to prove an analogue of the "Central 
Limit Theorem" in the form that the effect of repeated rotations, each 
specified by a distribution of 0, <, and V, will have a final distribution 
which should converge to the above one. A sufficient condition for this 
would probably be that the joint distribution of 0, <f>, and V must be 
such that at some point (0 , < , V ) there is an open neighbourhood in 
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which the joint probability distribution is differentiable with respect 
to 6, (f>, and V. 

Application to crystallography 

4.31 Another interesting application has been considered by 
Mackenzie and Thomson (1957) in connection with crystallography. 
A cube is rotated by a random rotation of the above type and it is 
desired to find the least angle through which it then needs to be rotated 
to bring it into the same orientation as before, i.e. having its faces 
parallel to the faces of its previous position but not necessarily being in 
the same position. Mackenzie and Thomson studied this problem by 
Monte Carlo methods, but later Handscomb (1958) and Mackenzie 
(1958) both obtained the exact distribution. 
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CHAPTER 5 

PROBLEMS OF COVERAGE 

Coverage on a plane lattice 

5.1 In this chapter we study problems which involve calculating prob- 
abilities that certain fixed geometrical figures or sets, usually in the plane, 
are covered by other figures whose position is in some way random. We 
begin by considering a particular class of problems which arise when we 
are given a plane lattice (which need not be rectangular) of points and 
we wish to study the probability distribution of the number of these 
points which lie in or on some geometrical figure,such as a circle, whose 
position in the plane is random. This is the sort of situation which could 
arise in practice when a circular sampling hoop is thrown at random on 
to a field in which there are plants whose centres are arranged in a 
regular lattice. 

5.2 The lattice might consist of all points in a plane which have integral 
coordinates (square lattice), or might consist of the points formed by the 
vertices of equal equilateral triangles closely packed, or might be of a 
more general kind. For the most part we consider only the square lattice. 
If a figure of any fixed shape, and possibly fixed orientation, is thrown 
at random on to a square lattice, the number of points it covers will be 
a random variable whose mean will be equal to the measure of the figure 
if the lattice squares have unit side. This is intuitively obvious but may 
be proved as in (5.2) below. 

Coverage on a line 

5.3 The evaluation of the variance is a much more difficult problem 
even when the figure has a simple shape such as a circle. To illustrate 
some of the principles involved we begin by considering the much 
simpler problem of the distribution of the number of lattice points in 
a random interval placed on a one-dimensional lattice. We suppose the 
latter to consist of the points 0, 1 2, . . . and the interval to have length 
/ and random position. 

Instead of taking a random interval and fixed lattice we can take a 
fixed interval and a lattice in random position. Suppose the fixed inter- 
val is ( \l y |/) and write 

/W-'- 1 (~iU**i/), 
= elsewhere, 
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and let the lattice consist of the points T, T 1, T 2, ..., where T is a 
random variable uniformly distributed on the interval (0, 1). Then the 
number of lattice points contained in the interval will be 

N(T) = l S f(T+n), (5.1) 

n - oo 

whose mean is clearly 

/[" f(u)du = l (5.2) 

J 00 

The variance of N(T) is 



2f(T+n)f(T+s)dT-l* 

n,s 



f(T)f(T+n)dT-P. (5.3) 



00 00 



The integral in (5.3) is equal to 

I- 2 (I- | n |) if || < / (5.4) 

and is zero otherwise. The variance is therefore 

S (/-|n !)-/. (5.5) 

If I = p +q, where /> is integral and < y < 1, this is equal to 

q-q*. (5.6) 

5.4 We can however look at this problem in another way. The sum 
for N (T) in (5.1), regarded as a function of T, is periodic with period 1 
and can be expanded in a convergent Fourier series 

N(t) = S C m exp(-2nimt), (5.7) 

W == CO 

where 

C m = f X W (0 exp (27TI Iff/) A 
J 

f(t)txp(2nimt)dt 

oo 

= /y'(2n)^(2niin) > (5.8) 

where <^ (w) is the Fourier transform 



ParsevaFs theorem gives 

= S |C M |. (5.10) 
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Since | C | 2 = / 2 we get 

(5.11) 



where the sum is taken over all integral values of m except for m 0. 
Inserting the value of /(/), we have 



1 sin |/w 

-r-y for u 4= 0, 



)-* for M = 0. (5.12) 

Using the above formula, we get 

rxr/^x-, 2 sin 2 w/7i ,,- -*x 

var[JV(T)] = a S--^-, (5.13) 

which is of quite a different form from (5.2) and (5.3). The equality of 
(5.6) and (5.13) is an example of Poisson's formula (Titchmarsh, 1937, 
p. 61). 

Coverage of a square lattice by a rectangle 

5.5 From this it is easy to deduce the result for a rectangle whose posi- 
tion on a lattice is random and whose sides are parallel to the two direc- 
tions of the lattice, since the number of lattice points covered may be 
regarded as the product of two independently distributed random vari- 
ables which are the number of lattice points on a linear lattice covered 
by random intervals corresponding to the sides of the rectangle. The 
variance of the number of covered points will vary in a complicated 
manner. If the sides are /, m, where / = p +#, m = P +Q (0 ^ <?, 
Q < 1), the variance is 



The case where the orientation of the rectangle is also random has 
not been worked out. 

5.6 D. G. Kendall (1948) and R. A. Rankin (Kendall and Rankin, 
1953) have considered the similar but more difficult problem of deter- 
mining the variance of the number of lattice points inside a circle of 
radius R and random position. 

Let the lattice consist of the points (ij) where ij 0, 1, 2, ... 
and write TV for the number of these inside the circle (x -a) 2 +(y-p) 2 
= R 2 . If a, /? = 0, the determination of JV as a function of I? is a classi- 
cal problem in the analytic theory of numbers. It is known that 

N-nR* = O(R +e ) for every e > 0, (5.15) 
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and Hardy has conjectured that 

N-nR* = 0(#*+ 6 ) (5.16) 

for every s > 0. When the circle is supposed to have a random position 
the problem becomes simpler in the sense that the mean square of 
NnR 2 can be shown to be O(R). To prove this we have to evaluate 
the variance of N for fixed R and a, /? uniformly and independently 
distributed on the interval (0, 1). From the above it is clear that 
E(N) = nR 2 . E(N 2 ) can be found in two ways. 

Let A (R, a, /?) be the number of lattice points in a circle of radius R 
with centre (a, /?). This is a doubly periodic function in a and /? and 
hence can be expanded, at least formally, in a double Fourier series so 
that 

A (R, a, 0) ~ S a exp 2*i (ma +nft). (5.17) 

m, n 

Define C(n y v) to be equal to unity or zero, according as the point (u y v) 
lies in the circle u 2 +v 2 ^ R 2 or not. Then 

A(R 9 * 9 fl) = SC(i-a,;-/0, 

u 

where i,j range over all lattice points. Then 

Q mn = I j A (R, a, p) exp - 2ni (ma. +nft) d* dp 

J J 

= X f r C(u,v)e~'" ( "" l+ "" ) dudv 

ij J i-l J j1 

= cos (2nmu) cos (2nnu) du dv 



(5.18) 

when w 2 +w 2 > 0. For m = w = we get Q 00 = nR 2 . We do not need 
the convergence of the double Fourier series but only the analogue of 
Parseval's identity which gives 



o o 

= n 2 R* +R 2 XZ t '(m 2 + a ).-\/i a {2nR(m 2 +n a )*} > (5.19) 
where the double sum S S' is taken over all values of m and n except 
m = n = 0. Let r(/) be the number of representations of the integer / 
as the sum of two squares. We can then write 

E(N 2 ) = n 2 R*+R 2 2 /- 1 r(/)/ 1 2 {2jr7?/ i }. (5.20) 

1=1 

These series are convergent because 
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for z > and some c > 0, and 



for every 6 > 0. Thus the variance of TV is 

var(#) = R* S l- l r(l)J^{2nRft}, (5.21) 

;=i 

and using the above bounds we get 

var(AT) = O(R) 9 (5.22) 

so that the standard error of N will increase not faster than Jf?*, or the 
square root of the scale factor for the circle. This is in sharp contrast 
with the case considered before of a random rectangle with sides parallel 
to the directions of the lattice. Using the asymptotic formula for the 
Bessel function, 

2 



we get 

var(W) = Rn~* X H r (/) cos* [2rc/Z/ - Jrc] +0(1). (5.23) 



This shows that as R > oo, R~ l var(N) does not tend to a limit but 
has upper and lower limits of indetermination which are those of 

\n~* % Hr (/){! 
/--i 

This in turn certainly lies between zero and 



= (0-676497) 2 - 0-457648 (5.24) 
Here C (#) is Riemann's zeta function and 

L(s) - l- 8 -3- 8 +5- 5 -7- 5 +etc. 

5.7 In the one-dimensional problem we obtained the variance in two 
different forms and showed that their equality was an example of 
Poisson's formula. We can do this here also. 

Denote by S N the square whose centre is at the origin and whose 
sides have lengths equal to 27V + 1 and are parallel to the axes. This has 
area (2Af + 1) 2 and contains (2N + 1) 2 points of the lattice. Round each 
point of the lattice inside SN we construct a circle of radius /?. If two 
circles have their centres at a distance t apart which is less than 2R they 
will overlap in a region which we may call a "lune" and whose area is 
easily found to be 
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{cos-(*/2fl) 
sin 2 0</0. 
o 

= R*{cQ*-*t(2R)-*-t(2R)-*[l-t*(4R*)- 1 ]*}. (5.25) 
Let P be a point inside the square S N and suppose that it is covered 
by n circles. To each circle we ascribe a density | (n- 1) at P if P is 
covered by n 1 other circles. If we integrate this density over the whole 
of the circle, we obtain half the sum of the areas of all lunes formed by 
this circle and each of the other (2N + 1) 2 1 circles. Thus the total 
density at P, which is the sum of the densities corresponding to each of 
the n circles which cover P, is \n(n- 1), and this is the total number of 
different lunes formed by pairs chosen out of the (2N -fl) 2 circles which 
cover P. Hence the integral of the total density over the square S N is 
the sum of the contents of all lunes, or parts of lunes, which are con- 
tained in S N . We denote this by M. Write p n (N) for the probability 
that a point chosen at random in the square S# is covered by n circles. 
This is equal to the proportion of this square which is covered by 
n circles. Then 

CO 

M = (2N+1)* X n(n-l)p n (N), (5.26) 

H=0 

which is, in fact, only a finite series. As N > oo, p n (N) will tend to 
a constant p n which is the relative fraction of the whole space common 
to n circles. 

5.8 We now find M in a different way. Let m be a vector from the 
origin of the lattice to a lattice point and | m | its length. For fixed R 
there is only a finite number of different sizes of lune, those for which 
^ t = | m | ^ 2R. For t > let P(t) be the number of different m 
for which | in | = t. Each circle will contain P(t) lunes corresponding 
to values of t < 2/?, and the total content of all lunes in a circle will be 
I, P(t)V(R,t) = VV(R,\m\) (5.27) 

0<t<2R m 

where the second sum is taken over all lattice points except (0, 0), and 
the convention is made that V(R, \ m |) = for | m | > 2R. There are 
(2N + 1) 2 circles with centres in S N , and each lune is defined by two of 
these so that 

lim (2W + 1)- 2 

N > oo m 

= S i(-l) p n . (5.28) 

n = 

On the other hand the sum of the areas of all circles or parts of circles 
inside S N may be written M', so that 

* = lim 
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This must be equal to the integral over S N of w, the number of spheres 
covering a point P, and this can be written as 



S np n = 

= V(R, 0). (5.29) 

Hence (5.28) can be written 

2n*p n = ZV(R,\m\), (5.30) 

1 m 

where the sum is now taken over all lattice points including the origin. 
Consider a point (a,/^), where (O^a^l, 0^/3^1). The num- 
ber of lattice points contained in a circle with this centre and radius R 
is equal to the number of circles of radius R and centres on the lattice 
points, which cover (a, /9), i.e. to the value of n for this point. Hence the 
distribution of the number of lattice points in a random circle of radius 
R is equal to the distribution of n when (a, /?) is chosen at random in the 
unit square. The variance is therefore 

np n V 



m|). (5.31) 

m 

This must also equal (5.5), and so we derive the equation 



= n (I- nR*) + V r(n) {cos - 1 n' (2R) -> - w* (2R) - 1 [1 - n (4/? 2 ) ~ l ]> }, 

o < n < x 

(5.32) 
which is a curious and unexpected example of Poisson's formula. 

5.9 The oddity about these results is that var(Af) is a sharply oscillat- 
ing function of R. In practical problems we must therefore calculate it 
exactly (which is probably best done using the finite series (5.31)), or else 
use the upper bound Ra 2 given by (5.24)). Some generalizations of these 
results have been given by Kendall and Rankin to oval curves, to hyper- 
spheres and hyperellipses, and to a hexagonal lattice. The results for 
oval curves contribute something to our knowledge of the method of 
"counting squares " in graphical integration, and those for circles thrown 
on hexagon lattices are useful in the theory of cutaneous sensations. The 
results are also of some interest in connection with estimating plant 
numbers in an area by throwing a hoop (or drawing a random circle) 
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in cases where the plants are not distributed in a Poisson field but have 
a more or less regular pattern. 

Random shapes 

5.10 In the above problem we have considered the case of a random 
set (the circle), which has a fixed shape, covering a fixed set (the lattice 
points). A wide class of problems of a different kind arises when we con- 
sider sets which are not only random in position but also in their geo- 
metric character. Two examples will illustrate the ideas involved. In 
the first we consider n intervals of unit length on the line oo < x < oo 
which are such that their centres x t are independently distributed with 
a distribution function F(x). The set of points covered by one or more 
of these intervals is then a random set. 

In the second example we consider n circles of unit radius whose 
centres fall at random inside a rectangle. The set is defined to consist 
of all those points inside the rectangle which are covered by one or more 
circles. In both cases we would like to determine the distribution of the 
measure of the set. 

Robbins' theorem on random sets 

5.11 Robbins (1944, 1945) (see also Takacs, 1958) has given a general 
theorem which makes it possible to obtain the moments. This theorem 
is contained, in substance, in earlier results of Kolmogoroff (1950) on 
conditional expectations, but here we follow Robbins' exposition as 
being more directly relevant to the geometric problem. 

We consider Lebesgue measurable sets, X y in an w-dimensional 
Euclidean space R n . We suppose that in the space, T, of all such sets 
there is a probability measure which we denote formally by p(X) y so 
that we can write 

prob{X'mS} = f C fi (X)dp(X) f (5.33) 

J T 

where S is a /o-measurable set of sets X y and C$ (X) is a funccion of the 
set X equal to unity if X lies in the set S and equal to zero elsewhere. 
In all cases we need consider, the set X will be defined by a finite num- 
ber of parameters, so that T can be taken as a Euclidean space with r, 
say, parameters O ly ...,0 r . Then the above integral will be a Lebesgue- 
Stieltjes integral over this space. In the above examples the space T 
could be taken as defined by the n coordinates of the centres of the 
intervals in the first case, and the 2n coordinates of the centres of the 
circles in the second case, remembering that there are then n \ points in 
the parameter space which correspond to the same geometrical con- 
figuration. 
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We define a function, g (x, X) 9 of points x in R n and sets X in 7 1 , 
such that 

g (x, X ) = 1, if x belongs to X, 

= 0, otherwise. (5.34) 

Then for fixed X, the Lebesgue integral of g(x, X) over R n is equal to 
the Lebesgue measure of X. For fixed x y the probability that x is 
covered by X will be given by the integral of g(x, X) over the space T. 
Denote the Lebesgue measure of X by /i (X), and suppose that g (x, X) 
is measurable with respect to the product measure in the product space 
R n x T. This will be true in all applications. Denote the differential 
element of the product measure by d^p(x,X}. Then, using Fubini's 
theorem, we have 

g(x,X)dpp(x,X) = f {f g (*,X)dp(X)}dp(x). (5.35) 

RnXT J Rn{J T J 

The inner integral on the right-hand side is the probability that a ran- 
dom set X covers a given point x, which we may write as P(xeX). 
Using Fubini's theorem again, we also have 



/fnXT 



so that 



X) = \ {[ g(x t X)d ft (x)\dp(X). 

JTUfln J 

= f p(X)dp(X), (5.36) 

J r 

f f*(X)dp(X) = f P(xeX)dn(x). (5.37) 

J T J R n 

This means that the expected value of the measure of X is equal to 
the integral over R n of the probability P(xeX). The intuitive idea 
behind this result is simple. If the space R n could be divided up into 
a large number of equal elements so small that each can be regarded as 
either belonging to X or not, the expected measure of X will be the 
expected number of these which lie in X multiplied by their common 
measure. Since the expectation of a sum is equal to the sum of the ex- 
pectations, even when the variates concerned are dependent, the ex- 
pected number lying in X will be equal to the sum over all elements 
of the expectation that each lies in X. 

5.12 It is important to extend the above result in such a way that higher 
moments of ju (X) can be found. Write p (x ly . . ., # m ) for the probability 
that x ly . . ., x m all belong to X. Then 



f P(xi:,x m )dn(x ly ...,x m )= f 

J Rn J 

. (5.38) 
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5.13 We can now apply these results to various examples of which the 
simplest concern random intervals on a line. In the first (Robbins, 
1944) we take a fixed interval (0, 1) and N random intervals of length 
a < 1 whose centres are uniformly distributed on the interval ( |fl> 
1 -f-|#). X is defined to be the common part of the interval (0, 1) with 
the set sum of the N intervals. Then the probability that any point x 
on (0, 1) is covered by a particular interval is 



If // (X) is the measure of X, we then have 



= 1 -(!-/>)* (5.39) 

In a similar way E[JU (X) 2 ] can be found and turns out to be somewhat 
complicated. A similar problem was considered by Votaw( 1946). Here 
we again have N intervals whose centres are uniformly distributed on 
an interval, and X is defined to be the set sum of all these intervals. In 
this case it is possible to find the complete distribution of // (X). Votaw 
also obtains the probability that X contains the whole of the interval. 
This is a complicated expression which changes its form at a number of 
points and is very similar to the solution to Stevens' and Fisher's prob- 
lem (compare (2.38)). In the latter, however, the basic interval on which 
the centres are distributed is wrapped around a circle, so that we have 
TV random arcs of fixed size, distributed on the circumference of a circle. 
Robbins' theorem could again be applied to obtain the moments of the 
part of the circle covered by the arcs. 

A problem in virology 

5.14 A much more difficult problem, which is due to Professor Fazekas 
de St. Groth, arises in virology. A virus particle which is taken to be a 
sphere (whose radius might be 50 juju) is attacked by antibodies which 
we represent, to a first approximation, as cylinders of length about 37 ^ 
and effectively negligible thickness. These attach themselves to the virus 
particles, standing up perpendicularly from the surface. We suppose 
that N such antibodies are stuck rigidly on the sphere, their points of 
attachment being distributed independently and at random on the sur- 
face. A single such antibody will prevent the virus particle from sticking 
to a plane surface (in fact a cell) at any point on a circular cap shaded by 
the antibody. This cap will be a circle whose angular radius is 55 and 
the caps can therefore be regarded as random circles distributed uni- 
formly over the surface of the sphere and possibly overlapping. Given 
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N (or more generally given the distribution of N), it is required to find 
the probability that the whole of the surface is shaded. 

5.15 No exact solution to this problem is known. If the angular radius 
of the cap is 0, it is not even known, in general, what is the smallest value 
of N, as a function of 0, which is such that the sphere can be completely 
covered (however, this is known for certain particular values of 6). 
Although it does not seem possible to obtain an analytical expression 
for the probability that every point of the sphere is covered, it is pos- 
sible to obtain the first and second moments of the area not covered, 
both when N is fixed and when it has a probability distribution such 
as the Poisson. 

Suppose the spherical area occupied by the cap is A, where 4n is the 
whole area of the sphere. Then, using Robbins' theorem, the expected 
area not covered is 

4n(l-A(4n)- l ) N . (5.40) 

The second moment can be obtained in the same way, but is more com- 
plicated since it is necessary to obtain the probability that two points, 
separated by an angular distance 0, are not covered. This can be found 
without much difficulty. From the first two moments it is possible, by 
making certain further assumptions, to find an approximation for the 
probability that the sphere is completely covered. It is hoped to publish 
a detailed discussion of this problem at a later date (compare Fazekas 
and Moran, Biometrika (1962), 49). 

5.16 Generalizations of Robbins' results on random intervals have been 
obtained by Robbins himself (1945), Bronowski and Neyman (1945), 
Garwood (1947), and Santalo (1947). The first four of these writers 
consider a fixed rectangle of sides #, b and random parallel rectangles 
of sides a, /J whose centres fall in a rectangle concentric with the fixed 
rectangle and with sides a -fa and b +/?. They obtain the mean and 
variance of the measure of the set common to the fixed set, and the set- 
sum of the random intervals. Robbins also obtains the ^-dimensional 
generalization of this, and Santalo considers the case where the random 
rectangles also have random orientation. 

Random circles on a square 

5.17 As a further example we consider circles C, falling at random on 
a square (Garwood, 1947). Suppose that there are N of these, that they 
have radius 0, and that their centres are independently and uniformly 
distributed over a region T consisting of all points in the plane whose 
distance from the square, which we call A and take to have unit side, 
is not greater than a. The area of T is 1 +4a -f rca 2 , and the area of each 
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C is na 2 . The probability of any point of the unit square being covered 
by at least one circle is clearly 

which is therefore equal to the expected area covered. To obtain the 
second moment consider two points P^ and P 2 inside A. With a suit- 
able coordinate system these can be represented by (#i,j>i), (x^y^ 
where < x ly y ly x 2y y 2 < 1. The probability that neither of these 
points is covered is q N (r), where q(r) is the fraction of T not inside 
either of the circles with radius a and centres Pj and P 2 . Since P 1 and 
P 2 are both inside A, this fraction is a function only of r, the distance 
between P x and P 2 . The maximum value of r is \/2, and if <j> (r) is the 
probability distribution of r we have 



f 

= 

J 



q(r)i(r)dr. (5.42) 



Write fi(r) for the area common to two circles of radius a whose 
centres are distant r apart. Then 

}(r) = 0, if r>2a, 

= 2a 2 (0- sinO cos 0), ifr < 2a, (5.43) 

where 6 = cos~ l r(2a)- 1 . 

Then 



- 

We now have to find <f>(r). We have 



The quantity = (#! a? 2 1 clearly has a distribution with probability 
density equal to 

2(1-|) 

for < f ^ 1, and therefore u = (^i-^ 2 ) a = 2 has the distribution 

l -^du. (5.45) 

Vw 

^ = (yiyz) 2 has the same distribution, and the distribution of 
r = \/( w +^) is found by integrating 



over the part of the region r 2 < u +v < (r +dr) 2 which is contained in 
the unit square. The probability density is then found to be 
G.P. H 
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<f>(r) = 2r(?z-4r +r 2 ), for < r < 1, 

= 2r{4sin- 1 r- 1 +4V(r 2 -l)--r 2 -7r-2}, forl < r < 2*. (5.46) 

Inserting (5.44) and (5.46) in (5.42) we obtain E(Y 2 ) by numerical 
quadrature. 

5.18 In a similar way we can consider circles on a fixed circle. Thus 
we take a fixed circle A of unit radius and suppose that N random 
circles of diameters r(r < 1) are such that their centres are distributed 
at random inside a circle concentric with A and having radius 1 +r. 
The second moment is then of the form 

E(Y*) = \ l q N (r)4(r)dr, (5.47) 

J 

where q(r) is given by (5.18) and ^(r) is the probability density of the 
distribution of the distance between two points taken at random inside 
the circle A. This distribution has already been obtained in Chapter 2. 

If N were a random variable having some discrete distribution, the 
first and second moments can be obtained simply by averaging over N. 
In particular, if N has a Poisson distribution we may imagine circles to 
be distributed at random on the plane in a Poisson field with some given 
density. If they touch the area A, their centres must lie in or on the 
region T and are uniformly distributed over T. This justifies the choice 
of the particular form of T, quite apart from the fact that it also gives 
the simplest results. 

Robbins (1945) and Garwood (1947) have solved the problem of 
random circles on a rectangle, and this has been generalized to n 
dimensions by Santalo (1947). 

5.19 A somewhat different class of problems arises when we suppose 
that the random covering set is not uniformly distributed over some 
region but has some other distribution. An example of this, discussed 
by Morgenthaler (1961) see also Solomon (1958) is that of a circle of 
radius R whose centre has a circular bivariate normal distribution about 
a point O. The random set is defined to be the common part of this 
circle with a circle of radius r and centre O. If A(x,y) is the overlap 
of the circles when the first circle has its centre at (x,y) and this point 
has a circular normal distribution with standard deviation a, the ex- 
pected common area is 



r A( X ,y)exp(-l-( X *+y*)}d X dy. (5.48) 

- co J - oo V. ZCT ) 



Since A(x,y) is a function of x 2 -fy 2 only, this can be obtained by 
numerical integration. It can also be expressed in terms of the "offset 
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circular probability function" p(R y r). This is the integral of a circular 
normal distribution, with unit standard deviation, over a circle of radius 
R whose centre is distant r from the mean of the distribution. Thus 



p(R,r) = (2)-' f f 

J J x* 

where a 2 -hi 2 = r 2 . We can write this as 



1 r2n rR 

=- 

^Jo Jo 



/e -* (ri + tt ' ) / (r)rfii > (5.49) 



where / (#) is the zero order modified Bessel function of the first kind. 
Using the facts that 

I' Q (*) = I, (*), I, (*) +zl[ (*) = */ (*), (5.50) 

where /i(Z) is the similar function of the first order, we can transform 
the above integral into 



_ e 



R ( ' 

Jo 



= l-q(R,r), say. (5.51) 

Tables of q(R,r) have been prepared by the Rand Corporation. 

A problem concerning small particles 

5.20 A very interesting class of problems of overlap has been con- 
sidered by Armitage (1949) and Mack (1954). These arise in the count- 
ing of dust particles on a sampling plate. The number of these in a given 
area may be underestimated because of overlapping, and we wish to cal- 
culate the expected number of "clumps" of various sizes, a clump being 
defined as a number of overlapping particles. 

The general theory, which we consider first, is due to Mack. We 
suppose that each particle is represented by a convex figure placed at 
random on an area A. Let the area a y and perimeter s, of each figure 
have some given joint distribution, and suppose that the figures are dis- 
tributed in a Poisson field with density L The most convenient way to 
define the latter condition exactly is to choose some definite point inside 
each figure which we shall call the "centre", and suppose that these 
centres are distributed in a Poisson field. 

5.21 For simplicity of exposition we consider not a randomly chosen 
number of figures whose areas and perimeters have a probability dis- 
tribution, but N figures whose centres are to lie at random in an area A. 
Of these N figures N r are to have areas a r and perimeters s r (r = 1,2, 
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...,&), where N = SAT r . We then consider the number of clumps 
formed by figures whose centres lie in an area a contained in A. The 
previous problem can then be solved by letting A tend to infinity whilst 
keeping NA ~ l fixed and equal to L 

We define a clump to consist of i figures which overlap, and are not 
overlapped by any other figure, where i = 1,2, ..., so that we include 
single figures as well. Let C be the expected total number of clumps, 
and C i the expected number with / figures. Then 

2C, = C, (5.52) 

v z c z = NA~\ (5.53) 

We shall prove that for A&~ 1 and TV large, 

C = *A -i L N r exp - 4 S NJ> W , (5.54) 

C 1 = a^^XA^exp-SA^^, (5.55) 

ft = aexp-S N u a u A~ l y (5.56) 

where ^4i m . = a n +#,, +$ M $ r (2:rc)- 1 , and /? is the expected area of a not 

covered by figures. 

The expected number of figures with centres in some small region 
dot of a, which are not covered by other figures, is p lt <5oc, where p u is 
the probability that a given figure is not covered by other figures. Since 
the expectation of a sum is the sum of the expectations, the expected 
number of single figures in a is />na. We must therefore evaluate /> n . 
Suppose that the figure is of type i with area a { and perimeter s t . Let 
its function of support, relative to an origin and axis fixed in itself, be 
Hi (0), and suppose that it has orientation ^ x . Then its function of sup- 
port is Hi(0 +</>i), where (f> l can be supposed to be randomly and uni- 
formly distributed on the interval (0, 2n). For <f> l fixed, consider the 
probability that a specified figure of type j, with function of support 
Hj (0) and orientation < 2 , does not overlap the first figure. This proba- 
bility is 1 A tj A~ l where A^ is the area of a figure whose function of 
support is H t (6 +<i) +Hj (6 +^ 2 )- 

We know that the perimeter and area of a convex figure, with func- 
tion of support h (0), are given by 

rtn 

h(e)do 

J 

and 

J o 
respectively. The expectation of the area A^ is therefore 

r27i r2n r2n 
JO JO JO 
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) dd, d0 2 

o J o 

2 ) dO, dd 2 . (5.57) 



J 

P f 2 *tf : 
Jo J o 



The last term is clearly zero, and so 
We therefore have 



l-^'- 1 n (l-ft w )* (5.58) 

i 1*i 

and hence (5.55) is true in the limit. 

5.22 To calculate C we associate with each clump the point P which 
is that centre of one of its figures which lies furthest to the right. We 
can clearly ignore cases where there are two or more such points since 
their probability is zero. Such a point P is called a right-hand centre. 
The probability that a right-hand centre falls in a small region of area 
<5oc will be p l <5a + o ((5a 2 ), where p l is to be determined. The probability 
that a right-hand centre P falls in such a region will be the probability 
that a figure has its centre in this region, and that no other figure over- 
lapping it has a centre further to the right. Let the area to the right of 
P in which the centres of other figures are not allowed to fall have ex- 
pectation F L , and define F R similarly for left-hand centres. Then 
F L = F R and F L +F R = E(A fi ) so that (5.54) follows. (5.56) can be 
proved directly from Robbins' theorem. 

Armitage (1949) studied this problem in the particular cases where 
the figures are circles and rectangles and obtained approximate formulae 
by a quite different method. His results can be derived from Mack's 
by inserting the particular values of the a t and s t . 

Lancaster's problem 

5.23 A different crowding problem has been considered by Lancaster 
(1950). Red blood cells in a suspension are counted by spreading a thin 
film of the blood on a glass slide marked in squares, and counting the 
number of red cells in each square. If the cells are distributed indepen- 
dently at random the numbers in each square should be distributed in 
Poisson distributions. This can be tested for by comparing the observed 
variance of the counts with their mean, since these should be equal for 
a Poisson distribution. In practice the variance tends to be rather less 
than the mean, the ratio declining from unity as the mean increases. One 
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cause of this is that cells in the denser squares may tend to be pushed 
into the less dense squares. Assuming that the cells are smooth circular 
disks which fall out of the suspension on to the glass slide, we may 
assume that any cell which arrives unhindered on the slide stays there, 
but that any cell which falls on top of a cell already on the slide, slides 
sideways along a radius of the latter until it too lies flat on the glass. 
Using this model it is possible to calculate the probability that a cell 
falling out of suspension in a given square is pushed over the boundary 
into another square, and from this an approximation to the true variance 
of the number appearing in any square. 

5.24 Another class of problems considered by Mack (1948, 1949, 
1953) see also Berg (1945) is also closely related to the theory of 
coverage. Suppose that n points are independently distributed at ran- 
dom over a region which may be one-, two- or three-dimensional. A 
/{-aggregate with respect to a figure of defined size, shape, and orienta- 
tion is defined to be a set of k points which can be covered by such a 
figure without covering any other point. Mack obtains the expected 
number of such aggregates in the one-dimensional case where the cover- 
ing figure is an interval, and in the two-dimensional case where the 
covering figures are squares, parallelograms, triangles, and circles, and 
he generalizes some of these results to three dimensions. 

5.25 It is evident that further classes of coverage problems can be 
defined. Mathematically they are not simple and may have to be re- 
solved by sampling experiments. The subject is of relatively recent 
growth, however, and it appears likely that the emergence of practical 
problems of the type we have described will give an impetus to con- 
siderable further developments. 
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